Evaluation Framework for Layered Meaning Representation

Rémi de Vergnette, Maxime Amblard, Bruno Guillaume


Abstract
We propose different modular evaluation metrics for Layered Meaning Representation, defined as YARN, a semantic formalism encoded using rich structures that generalize AMR graphs. While existing metrics like SMATCH evaluate graph-based semantic representations such as AMR, they cannot directly handle YARN’s more complex structures. We make full use of the modular nature of YARN to propose two families of metrics, depending on the linguistic features and type of semantic phenomenon targeted. The first one, SMATCHY, extends the AMR SMATCH metric. We also propose YARNBLEU, based on the SEMBLEU metric for AMR. We evaluate both families on a small dataset of human annotated YARN structures, adding random modifications simulating annotation mistakes and show that SMATCHY provides a more consistent and reliable approach with respect to the type of modifications considered.
Anthology ID:
2025.dmr-1.5
Volume:
Proceedings of the Sixth International Workshop on Designing Meaning Representations
Month:
August
Year:
2025
Address:
Prague, Czechia
Editors:
Kenneth Lai, Shira Wein
Venues:
DMR | WS
SIG:
Publisher:
Association for Computational Lingustics
Note:
Pages:
38–48
Language:
URL:
https://preview.aclanthology.org/gwc-25-ingestion/2025.dmr-1.5/
DOI:
Bibkey:
Cite (ACL):
Rémi de Vergnette, Maxime Amblard, and Bruno Guillaume. 2025. Evaluation Framework for Layered Meaning Representation. In Proceedings of the Sixth International Workshop on Designing Meaning Representations, pages 38–48, Prague, Czechia. Association for Computational Lingustics.
Cite (Informal):
Evaluation Framework for Layered Meaning Representation (de Vergnette et al., DMR 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/gwc-25-ingestion/2025.dmr-1.5.pdf