When Less Is More: Logits-Constrained Framework with RoBERTa for Ancient Chinese NER

Wenjie Hua, Shenghan Xu


Abstract
This report presents our team’s work on ancient Chinese Named Entity Recognition (NER) for EvaHan 20251. We propose a two-stage framework combining GujiRoBERTa with a Logits-Constrained (LC) mechanism. The first stage generates contextual embeddings using GujiRoBERTa, followed by dynamically masked decoding to enforce valid BMES transitions. Experiments on EvaHan 2025 datasets demonstrate the framework’s effectiveness. Key findings include the LC framework’s superiority over CRFs in high-label scenarios and the detrimental effect of BiLSTM modules. We also establish empirical model selection guidelines based on label complexity and dataset size.
Anthology ID:
2025.alp-1.25
Volume:
Proceedings of the Second Workshop on Ancient Language Processing
Month:
May
Year:
2025
Address:
The Albuquerque Convention Center, Laguna
Editors:
Adam Anderson, Shai Gordin, Bin Li, Yudong Liu, Marco C. Passarotti, Rachele Sprugnoli
Venues:
ALP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
192–196
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.alp-1.25/
DOI:
Bibkey:
Cite (ACL):
Wenjie Hua and Shenghan Xu. 2025. When Less Is More: Logits-Constrained Framework with RoBERTa for Ancient Chinese NER. In Proceedings of the Second Workshop on Ancient Language Processing, pages 192–196, The Albuquerque Convention Center, Laguna. Association for Computational Linguistics.
Cite (Informal):
When Less Is More: Logits-Constrained Framework with RoBERTa for Ancient Chinese NER (Hua & Xu, ALP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.alp-1.25.pdf