您现在的位置是：首页 > 数据库

当前栏目

Oracle 两个逗号分割的字符串，使用regexp_substr和regexp_replace等实现获取交集、差集的sql实现过程解析

Oracle SQL 实现获取解析字符串过程两个

2023-09-11 14:21:08 时间

Oracle数据库的两个字段值为逗号分割的字符串，例如：字段A值为“1,2,3,5”，字段B为“2”。
想获取两个字段的交集（相同值）2，获取两个字段的差集（差异值）1,3,5。

一、最终实现的sql语句

1、获取交集（相同值）：

select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '1,2,3,5' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1
intersect -- 取交集
select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '2' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1;
/*结果：
2
*/

2、获取差集（差异值）：

select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '1,2,3,5' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1
minus --取差集
select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '2' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1;
/*结果：
1
3
5
*/

二、实现过程用到的函数用法说明

1、regexp_substr
正则表达式分割字符串，函数格式如下：

function regexp_substr(strstr, pattern [,position] [,occurrence] [,modifier] [subexpression])
__srcstr：需要进行正则处理的字符串
__pattern：进行匹配的正则表达式
__position：可选参数，表示起始位置，从第几个字符开始正则表达式匹配（默认为1）
__occurrence：可选参数，标识第几个匹配组，默认为1
__modifier：可选参数，表示模式（'i'不区分大小写进行检索；'c'区分大小写进行检索。默认为'c'。）

使用例子：

select 
regexp_substr('1,2,3,5','[^,]+') AS t1, 
regexp_substr('1,2,3,5','[^,]+',1,2) AS t2,
regexp_substr('1,2,3,5','[^,]+',1,3) AS t3,
regexp_substr('1,2,3,5','[^,]+',1,4) AS t4,
regexp_substr('1,2,3,5','[^,]+',2) AS t5,
regexp_substr('1,2,3,5','[^,]+',2,1) AS t6,
regexp_substr('1,2,3,5','[^,]+',2,2) AS t7
from dual; 
/*结果：
1    2    3    5    2    2    3
*/

2、regexp_replace

通过正则表达式来进行匹配替换，函数格式如下：

function regexp_substr(srcstr, pattern [,replacestr] [,position] [,occurrence] [,modifier])
__srcstr：需要进行正则处理的字符串
__pattern：进行匹配的正则表达式
__replacestr：可选参数，替换的字符串，默认为空字符串
__position：可选参数，表示起始位置，从第几个字符开始正则表达式匹配（默认为1）
__occurrence：可选参数，标识第几个匹配组，默认为1
__modifier：可选参数，表示模式（'i'不区分大小写进行检索；'c'区分大小写进行检索。默认为'c'。）

使用例子1：

select 
regexp_replace('1,2,3,5','5','4') t1,
regexp_replace('1,2,3,5','2|3',4) t2,
regexp_replace('1,2,3,5','[^,]+') t3,
regexp_replace('1,2,3,5','[^,]+','') t4,
regexp_replace('1,2,3,5','[^,]+','*') t5
from dual; 
/*结果：
1,2,3,4    1,4,4,5    ,,,    ,,,    *,*,*,*
*/

使用例子2（截取字符串中的指定字符）：

select  
regexp_replace('同意（72小时自动确认）--张三(2015-01-02 08:50:13);不同意。说明--李四(2022-01-20 12:20:17);同意。测试。--王五(2022-01-20 13:20:28);','(\d)|(不?同意\S*--)|(小时自动确认)|[。（）(): -]','') res
from dual;
/*
结果：
张三;李四;王五;
*/

使用例子3（截取字符串中的指定字符）：

/*说明：
json内容：[{"advantage":"未知","disadvantage":"未知","unitName":"XX公司","unitRemark":""},{"advantage":"未知","disadvantage":"未知","unitName":"YY公司","unitRemark":""}]
正则说明：* 和 + 限定符都是贪婪的，它们会尽可能多的匹配文字，在它们后面加上一个 ? 就可以实现非贪婪或最小匹配
*/
select 
regexp_replace(json, '(\{"advantage\S*?unitName":")|(","unitRemark\S*?\})|\[|\]', '') 
from tb
/*
结果：XX公司,YY公司
*/

使用例子4（替换字符串中的日期）：

select '测试 (2022-09-01)', regexp_replace('测试 (2022-09-01)', '(\d{4}-\d{2}-\d{2})', to_char(sysdate,'yyyy-mm-dd')) from dual;
/*结果：
测试 (2022-09-01)      测试 (2022-09-29)
*/

3、connect by

（1）connect by单独用，返回多行结果

select rownum from dual connect by rownum < 5;
/*结果：
1
2
3
4
*/

（2）一般通过start with . . . connect by . . .子句来实现SQL的层次查询

select 
id,
name,
sys_connect_by_path(id,'\') idpath,
sys_connect_by_path(name, '\') namepath
from (
select 1 id, '广东' name, 0 pid from dual
union 
select 2 id, '广州' name , 1 pid from dual
union 
select 3 id, '深圳' name , 1 pid from dual
) 
start with pid = 0
connect by prior id = pid;

/*结果：
1    广东    \1    \广东
2    广州    \1\2    \广东\广州
3    深圳    \1\3    \广东\深圳
*/

三、总结

由上面函数用法，可知下面语句可以把字符串“1,2,3,5”转换为4行记录

select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '1,2,3,5' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1

然后在2个结果中使用集合运算符（UNION/UNION ALL 并集，INTERSECT 交集，MINUS 差集）进行最终处理。

猜你喜欢

python之psutil模块（获取系统性能信息（CPU,内存，磁盘，网络）
STM 软件事务内存——本质是为提高并发，通过事务来管理内存的读写访问以避免锁的使用
Java实现蓝桥杯VIP 算法训练猴子分苹果
K8s的Service详解
【17.00%】【codeforces 621D】Rat Kwesh and Cheese
AndroidThings系列学习
有关自动目视解译系统的假设
【CSS】盒子模型外边距 ② ( 盒子模型水平居中 | 盒子内文字、行内元素、行内块元素居中对齐 )
LASlib 读写点云
scp复制文件到远程服务器上
风光场景削减及源荷不确定性的虚拟电厂随机优化调度研究（Matlab代码实现）
Linux系统中安装mysql注意事项
iptables防火墙与日志系统配合使用监控服务器特点端口的防问源IP
Arduino Nano + WIZ550io = 简易上网
[Javascript AST] 3. Continue: Write ESLint rule
like 和 contains 和match() against()
Open3D(C++) 计算点云包围盒
4.4 CUDA prefix sum一步一步优化
.NET平台开源项目速览(7)关于NoSQL数据库LiteDB的分页查询解决过程
对CAB文件进行数字签名
Qt QCheckBox QRadioButton
JDK1.8 新特性

相关主题

oracle sql命令
Oracle_常用SQL
oracle集群
nvl函数 oracle
SQL查询效率(Oracle)
oracle sql语句
Oracle 语句大全
oracle之回顾二
cx_Oracle
oracle sql优化

zl程序教程

当前栏目

Oracle 两个逗号分割的字符串，使用regexp_substr和regexp_replace等实现获取交集、差集的sql实现过程解析

相关文章