Cross-domain Named Entity Recognition via Graph Matching

Junhao Zheng, Haibin Chen, Qianli Ma


Abstract
Cross-domain NER is a practical yet challenging problem since the data scarcity in the real-world scenario. A common practice is first to learn a NER model in a rich-resource general domain and then adapt the model to specific domains. Due to the mismatch problem between entity types across domains, the wide knowledge in the general domain can not effectively transfer to the target domain NER model. To this end, we model the label relationship as a probability distribution and construct label graphs in both source and target label spaces. To enhance the contextual representation with label structures, we fuse the label graph into the word embedding output by BERT. By representing label relationships as graphs, we formulate cross-domain NER as a graph matching problem. Furthermore, the proposed method has good applicability with pre-training methods and is potentially capable of other cross-domain prediction tasks. Empirical results on four datasets show that our method outperforms a series of transfer learning, multi-task learning, and few-shot learning methods.
Anthology ID:
2022.findings-acl.210
Volume:
Findings of the Association for Computational Linguistics: ACL 2022
Month:
May
Year:
2022
Address:
Dublin, Ireland
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2670–2680
Language:
URL:
https://aclanthology.org/2022.findings-acl.210
DOI:
10.18653/v1/2022.findings-acl.210
Bibkey:
Cite (ACL):
Junhao Zheng, Haibin Chen, and Qianli Ma. 2022. Cross-domain Named Entity Recognition via Graph Matching. In Findings of the Association for Computational Linguistics: ACL 2022, pages 2670–2680, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
Cross-domain Named Entity Recognition via Graph Matching (Zheng et al., Findings 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2022.findings-acl.210.pdf
Software:
 2022.findings-acl.210.software.zip
Data
CrossNER