Multi-Strategy Named Entity Recognition System for Ancient Chinese

Wenxuan Dong, Meiling Liu


Abstract
We present a multi-strategy Named Entity Recognition (NER) system for ancient Chi-nese texts in EvaHan2025. Addressing dataset heterogeneity, we use a Conditional Random Field (CRF) for Tasks A and C to handle six entity types’ complex dependencies, and a lightweight Softmax classifier for Task B’s simpler three-entity tagset. Ablation studies on training data confirm CRF’s superiority in capturing sequence dependencies and Softmax’s computational advantage for simpler tasks. On blind tests, our system achieves F1-scores of 83.94%, 88.31%, and 82.15% for Test A, B, and C—outperforming baselines by 2.46%, 0.81%, and 9.75%. With an overall F1 improvement of 4.30%, it excels across historical and medical domains. This adaptability enhances knowledge extraction from ancient texts, offering a scalable NER framework for low-resource, complex languages.
Anthology ID:
2025.alp-1.28
Volume:
Proceedings of the Second Workshop on Ancient Language Processing
Month:
May
Year:
2025
Address:
The Albuquerque Convention Center, Laguna
Editors:
Adam Anderson, Shai Gordin, Bin Li, Yudong Liu, Marco C. Passarotti, Rachele Sprugnoli
Venues:
ALP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
213–220
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.alp-1.28/
DOI:
Bibkey:
Cite (ACL):
Wenxuan Dong and Meiling Liu. 2025. Multi-Strategy Named Entity Recognition System for Ancient Chinese. In Proceedings of the Second Workshop on Ancient Language Processing, pages 213–220, The Albuquerque Convention Center, Laguna. Association for Computational Linguistics.
Cite (Informal):
Multi-Strategy Named Entity Recognition System for Ancient Chinese (Dong & Liu, ALP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.alp-1.28.pdf