Exploiting Hierarchically Structured Categories in Fine-grained Chinese Named Entity Recognition

Jiuding Yang, Jinwen Luo, Weidong Guo, Di Niu, Yu Xu


Abstract
Chinese Named Entity Recognition (CNER) is a widely used technology in various applications. While recent studies have focused on utilizing additional information of the Chinese language and characters to enhance CNER performance, this paper focuses on a specific aspect of CNER known as fine-grained CNER (FG-CNER). FG-CNER involves the use of hierarchical, fine-grained categories (e.g. Person-MovieStar) to label named entities. To promote research in this area, we introduce the FiNE dataset, a dataset for FG-CNER consisting of 30,000 sentences from various domains and containing 67,651 entities in 54 fine-grained flattened hierarchical categories. Additionally, we propose SoftFiNE, a novel approach for FG-CNER that utilizes a custom-designed relevance scoring function based on label structures to learn the potential relevance between different flattened hierarchical labels. Our experimental results demonstrate that the proposed SoftFiNE method outperforms the state-of-the-art baselines on the FiNE dataset. Furthermore, we conduct extensive experiments on three other datasets, including OntoNotes 4.0, Weibo, and Resume, where SoftFiNE achieved state-of-the-art performance on all three datasets.
Anthology ID:
2023.findings-acl.211
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3407–3421
Language:
URL:
https://aclanthology.org/2023.findings-acl.211
DOI:
10.18653/v1/2023.findings-acl.211
Bibkey:
Cite (ACL):
Jiuding Yang, Jinwen Luo, Weidong Guo, Di Niu, and Yu Xu. 2023. Exploiting Hierarchically Structured Categories in Fine-grained Chinese Named Entity Recognition. In Findings of the Association for Computational Linguistics: ACL 2023, pages 3407–3421, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Exploiting Hierarchically Structured Categories in Fine-grained Chinese Named Entity Recognition (Yang et al., Findings 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/dois-2013-emnlp/2023.findings-acl.211.pdf