Multilingual Grammatical Error Annotation: Combining Language-Agnostic Framework with Language-Specific Flexibility
Mengyang Qiu, Tran Minh Nguyen, Zihao Huang, Zelong Li, Yang Gu, Qingyu Gao, Siliang Liu, Jungyeul Park
Abstract
Grammatical Error Correction (GEC) relies on accurate error annotation and evaluation, yet existing frameworks, such as errant, face limitations when extended to typologically diverse languages. In this paper, we introduce a standardized, modular framework for multilingual grammatical error annotation. Our approach combines a language-agnostic foundation with structured language-specific extensions, enabling both consistency and flexibility across languages. We reimplement errant using stanza to support broader multilingual coverage, and demonstrate the framework’s adaptability through applications to English, German, Czech, Korean, and Chinese, ranging from general-purpose annotation to more customized linguistic refinements. This work supports scalable and interpretable GEC annotation across languages and promotes more consistent evaluation in multilingual settings. The complete codebase and annotation tools can be accessed at https://github.com/open-writing-evaluation/jp_errant_bea.- Anthology ID:
- 2025.bea-1.15
- Volume:
- Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)
- Month:
- July
- Year:
- 2025
- Address:
- Vienna, Austria
- Editors:
- Ekaterina Kochmar, Bashar Alhafni, Marie Bexte, Jill Burstein, Andrea Horbach, Ronja Laarmann-Quante, Anaïs Tack, Victoria Yaneva, Zheng Yuan
- Venues:
- BEA | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 202–212
- Language:
- URL:
- https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bea-1.15/
- DOI:
- Cite (ACL):
- Mengyang Qiu, Tran Minh Nguyen, Zihao Huang, Zelong Li, Yang Gu, Qingyu Gao, Siliang Liu, and Jungyeul Park. 2025. Multilingual Grammatical Error Annotation: Combining Language-Agnostic Framework with Language-Specific Flexibility. In Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025), pages 202–212, Vienna, Austria. Association for Computational Linguistics.
- Cite (Informal):
- Multilingual Grammatical Error Annotation: Combining Language-Agnostic Framework with Language-Specific Flexibility (Qiu et al., BEA 2025)
- PDF:
- https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bea-1.15.pdf