Simple Named Entity Recognition (NER) System with RoBERTa for Ancient Chinese

Yunmeng Zhang, Meiling Liu, Hanqi Tang, Shige Lu, Lang Xue


Abstract
Named Entity Recognition (NER) is a fun-damental task in Natural Language Process-ing (NLP), particularly in the analysis of Chi-nese historical texts. In this work, we pro-pose an innovative NER model based on Gu-jiRoBERTa, incorporating Conditional Ran-dom Fields (CRF) and Long Short Term Mem-ory Network(LSTM) to enhance sequence la-beling performance. Our model is evaluated on three datasets from the EvaHan2025 competi-tion, demonstrating superior performance over the baseline model, SikuRoBERTa-BiLSTM-CRF. The proposed approach effectively cap-tures contextual dependencies and improves entity boundary recognition. Experimental re-sults show that our method achieves consistent improvements across almost all evaluation met-rics, highlighting its robustness and effective-ness in handling ancient Chinese texts.
Anthology ID:
2025.alp-1.27
Volume:
Proceedings of the Second Workshop on Ancient Language Processing
Month:
May
Year:
2025
Address:
The Albuquerque Convention Center, Laguna
Editors:
Adam Anderson, Shai Gordin, Bin Li, Yudong Liu, Marco C. Passarotti, Rachele Sprugnoli
Venues:
ALP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
206–212
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.alp-1.27/
DOI:
Bibkey:
Cite (ACL):
Yunmeng Zhang, Meiling Liu, Hanqi Tang, Shige Lu, and Lang Xue. 2025. Simple Named Entity Recognition (NER) System with RoBERTa for Ancient Chinese. In Proceedings of the Second Workshop on Ancient Language Processing, pages 206–212, The Albuquerque Convention Center, Laguna. Association for Computational Linguistics.
Cite (Informal):
Simple Named Entity Recognition (NER) System with RoBERTa for Ancient Chinese (Zhang et al., ALP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.alp-1.27.pdf