Question-parsing with Abstract Meaning Representation enhanced by adding small datasets

Johannes Heinecke, Maria Boritchev, Frédéric Herledan


Abstract
Abstract Meaning Representation (AMR) is a graph-based formalism for representing meaning in sentences. As the annotation is quite complex, few annotated corpora exist. The most well-known and widely-used corpora are LDC’s AMR 3.0 and the datasets available on the new AMR website. Models trained on the LDC corpora work fine on texts with similar genre and style: sentences extracted from news articles, Wikipedia articles. However, other types of texts, in particular questions, are less well processed by models trained on this data. We analyse how adding few sentence-type specific annotations can steer the model to improve parsing in the case of questions in English.
Anthology ID:
2025.nodalida-1.26
Volume:
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)
Month:
march
Year:
2025
Address:
Tallinn, Estonia
Editors:
Richard Johansson, Sara Stymne
Venue:
NoDaLiDa
SIG:
Publisher:
University of Tartu Library
Note:
Pages:
252–257
Language:
URL:
https://preview.aclanthology.org/corrections-2025-06/2025.nodalida-1.26/
DOI:
Bibkey:
Cite (ACL):
Johannes Heinecke, Maria Boritchev, and Frédéric Herledan. 2025. Question-parsing with Abstract Meaning Representation enhanced by adding small datasets. In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), pages 252–257, Tallinn, Estonia. University of Tartu Library.
Cite (Informal):
Question-parsing with Abstract Meaning Representation enhanced by adding small datasets (Heinecke et al., NoDaLiDa 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/corrections-2025-06/2025.nodalida-1.26.pdf