Make Good Use of GujiRoBERTa to Identify Entities in Ancient Chinese

Lihan Lin, Yiming Wang, Jiachen Li, Huan Ouyang, Si Li


Abstract
This report describes our model submitted for the EvaHan 2025 shared task on named entity recognition for ancient Chinese literary works. Since we participated in the task of closed modality, our method is based on the appointed pretrained language model GujiRoBERTajian-fan and we used appointed datasets.We carried out experiments on decodingstrategies and schedulers to verify the effect of our method. In the final test, our method outperformed the official baseline, demonstrating its effectiveness. In the end, for the results, this report gives an analysis from the perspective of data composition.
Anthology ID:
2025.alp-1.23
Volume:
Proceedings of the Second Workshop on Ancient Language Processing
Month:
May
Year:
2025
Address:
The Albuquerque Convention Center, Laguna
Editors:
Adam Anderson, Shai Gordin, Bin Li, Yudong Liu, Marco C. Passarotti, Rachele Sprugnoli
Venues:
ALP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
182–186
Language:
URL:
https://preview.aclanthology.org/Author-page-Marten-During-lu/2025.alp-1.23/
DOI:
Bibkey:
Cite (ACL):
Lihan Lin, Yiming Wang, Jiachen Li, Huan Ouyang, and Si Li. 2025. Make Good Use of GujiRoBERTa to Identify Entities in Ancient Chinese. In Proceedings of the Second Workshop on Ancient Language Processing, pages 182–186, The Albuquerque Convention Center, Laguna. Association for Computational Linguistics.
Cite (Informal):
Make Good Use of GujiRoBERTa to Identify Entities in Ancient Chinese (Lin et al., ALP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/Author-page-Marten-During-lu/2025.alp-1.23.pdf