A UD Parser to the Rescue: A Method for Bringing a Classical Annotated Corpus to Life Again

Lucelene Lopes, Magali S. Duran, Thiago A. S. Pardo


Abstract
This paper reports on an effort to recover the classical morphosyntactically annotated corpus MacMorpho and realign it with the current version of the Universal Dependencies framework. We introduce a knowledge-rich approach grounded in a syntactic parser and on a specially designed tagset compatibility strategy in order to generate a "silver-standard" resource: the MacMorpho-UD-2.17. We evaluate this resource through multiple complementary methods, providing evidence for the quality of both our approach and the resulting annotation.
Anthology ID:
2026.propor-1.24
Volume:
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Month:
April
Year:
2026
Address:
Salvador, Brazil
Editors:
Marlo Souza, Iria de-Dios-Flores, Diana Santos, Larissa Freitas, Jackson Wilke da Cruz Souza, Eugénio Ribeiro
Venue:
PROPOR
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
240–249
Language:
URL:
https://preview.aclanthology.org/ingest-dnd/2026.propor-1.24/
DOI:
Bibkey:
Cite (ACL):
Lucelene Lopes, Magali S. Duran, and Thiago A. S. Pardo. 2026. A UD Parser to the Rescue: A Method for Bringing a Classical Annotated Corpus to Life Again. In Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1, pages 240–249, Salvador, Brazil. Association for Computational Linguistics.
Cite (Informal):
A UD Parser to the Rescue: A Method for Bringing a Classical Annotated Corpus to Life Again (Lopes et al., PROPOR 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-dnd/2026.propor-1.24.pdf