您现在的位置是：首页 > 其他

当前栏目

论文阅读《Joint Extraction of Entities and Relations Based on a Novel Decompostion Strategy》

On 论文 and of 阅读 based Strategy

2023-09-14 09:13:20 时间

前言

文章来源：LawsonAbs（CSDN）
望各位读者审慎阅读

1.之前的问题

因为之前的信息抽取模型使用的方法是先抽取再分类，或者是一个联合标注的方法，这些方法存在的问题就是：在抽取实体和关系的时候忽略了内部的结构；同时也会有产生大量的冗余的实体对。

2.解决办法

将联合抽取任务分解成两个子任务：HE extraction 和 TER extraction。第一个子任务是为了抽取所有可能存在关系对的头实体；第二个子任务是为了抽取在第一个任务得到的头实体对应的尾实体和关系。

那么具体该怎么做上面这两个任务呢？具体的方法就是：

使用作者提出的 span-based tagging scheme，将这两个任务分解成数个序列标注来完成

原话是这么说的：

Next, these two subtasks are further deconstructed into several sequence labeling problems based on our proposed span-based tagging scheme, which are conveniently solved by a hierarchical boundary tagger and a multi-span decoding algorithm.

这里的意思就是：这两个任务都可以通过一个hierarchical boundary tagger and a mulit-span docding algorithm解决。

3. 解码算法

接下来聊聊整个模型的解码算法。
其实我之前是对解码算法不怎么理解的，因为我一直觉得：有什么样的tagging方式，就对应什么样的解码操作不就行了吗？难道还要专门写一个解码算法吗？对，还需要专门写个解码算法，因为复杂的编码结构就必须对应一个简要的解码方式，否则就是一篇不充分的论文。以编码的方式去解码这个想法应该是正确的，这篇文章也以伪代码的方式给出了解码的方式，如下所示：
在这里插入图片描述
这里的解码部分主要是13 - 20行，其主要的思想就是：按照最近邻匹配的方式解码。

4.问题

4.1 `a novel decomposition strategy` 中的 `novel` 体现在什么地方？

这里的novel 修饰的是decomposition，说明的问题就是：分解的方式很novel，因为这里分成了两部分，第一部分是 head entity的抽取，第二部分是end entity + relation的抽取。

4.2 为什么要 `decompostion`？ `decomposition` 能带来什么？

为了噱头

4.2 怎么做到“范式不再在第一步提取所有实体，只识别可能参与目标三元组的头实体”？

4.3 文章中说的`hierarchical boundary tagger`是什么？

先找出文中的提及到的几处 hierarchical

To enhance the association between boundary positions, we present a hierarchical boundary tagger, which labels the start and end positions separately in a cascade structure and decode them together by a multi-span decoding algorithm.

the hierarchical boundary tagger, which is the basic labeling module in our method.

According to our tagging scheme, we utilize a unified architecture to extract HE and TER. In this paper, we wrap such extractor into a general module named hierarchical boundary tagger (abbreviated as HBT).

Such decomposition indicates that there is a natural order among the tasks: predicting end positions may benefit from the prediction results of start positions, which motivates us to employ a hierarchical tagging structure.

这句话读着有点儿难受，识别头实体的end难道竟然可以从头实体的start中受益？
下面来看一下HBT的基本结构图：
在这里插入图片描述

根据上图可以看出，这个模型是存在一个层级，所以叫做hierarchical。同时可以看到这里的 Start Tags 的标注结果会传到End Tags 模块中使用。

猜你喜欢

MySQL Error number: MY-010020; Symbol: ER_DD_INIT_FAILED; SQLSTATE: HY000 报错故障修复远程处理
榜单发布 | WitAwards 2022中国网络安全行业年度评选结果
通道分离与合并、彩色图转换为灰度图、二值化
如何搭建完备的 Linux 编译环境（linux编译环境搭建）
使用vi编辑在Linux系统中的文件（linux中vi）
在Java中常见的数据类型有哪些？「建议收藏」
Redis在NoSQL时代的重要作用（nosql与redis）
探索 Linux 内核引导的奥秘（linux内核引导）
db2v8的pdf文档资料
开源Fast R-CNN代码实现物体识别[通俗易懂]
神还原物体复杂、高频细节，4K-NeRF高保真视图合成来了
网红美食广场，如何打破地域文化的限制
站Oracle学习路上，一路向前！（oracle学习网）
ORA-24326: handle passed in is already initialized ORACLE 报错故障修复远程处理
未来哪些行业值得加入？
ORA-13124: unable to determine column id for column string ORACLE 报错故障修复远程处理
JMeter扩展开发：自定义函数
配置【Linux如何查询硬件配置】（linux查询硬件）
JavaScript实现MySQL数据库查询（js查询mysql数据库）
BufferedWriter 和 BufferedReader 的基本用法
ORA-44415: Invalid ACL: Undefined privileges ORACLE 报错故障修复远程处理

相关主题

jQuery on()方法
jquery on
jQuery .on
sql left join on
hive on spark的坑
v-on
v-on指令
v - on
on使用详解
v-bind和v-on

zl程序教程

当前栏目

论文阅读《Joint Extraction of Entities and Relations Based on a Novel Decompostion Strategy》

前言

1.之前的问题

2.解决办法

3. 解码算法

4.问题

4.1 `a novel decomposition strategy` 中的 `novel` 体现在什么地方？

4.2 为什么要 `decompostion`？ `decomposition` 能带来什么？

4.2 怎么做到“范式不再在第一步提取所有实体，只识别可能参与目标三元组的头实体”？

4.3 文章中说的`hierarchical boundary tagger`是什么？

相关文章

当前栏目

论文阅读《Joint Extraction of Entities and Relations Based on a Novel Decompostion Strategy》

前言

1.之前的问题

2.解决办法

3. 解码算法

4.问题

4.1 a novel decomposition strategy 中的 novel 体现在什么地方？

4.2 为什么要 decompostion？ decomposition 能带来什么？

4.2 怎么做到“范式不再在第一步提取所有实体，只识别可能参与目标三元组的头实体”？

4.3 文章中说的hierarchical boundary tagger是什么？

相关文章

4.1 `a novel decomposition strategy` 中的 `novel` 体现在什么地方？

4.2 为什么要 `decompostion`？ `decomposition` 能带来什么？

4.3 文章中说的`hierarchical boundary tagger`是什么？