CAN-NER Convolutional Attention Network for Chinese Named Entity Recognition

mac2025-03-20 27

作者提出了基于注意力机制的卷积神经网络架构，用于中文命名实体识别。

主要的框架是CNN with the local-attention 和Bi-GRU with global self-attention

总体的框架图如下：

字符的嵌入输入 $x$

local attention步骤

$\in R^{d_h} W_1; W_2 \in R^{{d_h};de}$

卷积步骤

$h^c_j = \sum_k[W^c ∗ h_{j-\frac{k-1}{2},...,j-\frac{k+1}{2}}+b_c]$

典型的卷积操作，只不过是最后sum pooling layer

跟BiLSTM+CRF没有什么区别，主要是加了一个中间加了一个global attention

与上面的local attention类似，只不过范围不再是cnn的windows size，而是针对

整个序列

最新回复(0)