Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset

Reza Takhshid, Tara Azin, Razieh Shojaei, Mohammad Bahrani


Abstract
This paper introduces the Persian Abstract Meaning Representation (AMR) guidelines, a detailed guide for annotating Persian sentences with AMR, focusing on the necessary adaptations to fit Persian’s unique syntactic structures. We discuss the development process of a Persian AMR gold standard dataset consisting of 1562 sentences created following the guidelines. By examining the language specifications and nuances that distinguish AMR annotations of a low-resource language like Persian, we shed light on the challenges and limitations of developing a universal meaning representation framework. The guidelines and the dataset introduced in this study highlight such challenges, aiming to advance the field.
Anthology ID:
2024.umrpw-1.2
Volume:
Proceedings of the 2024 UMR Parsing Workshop
Month:
June
Year:
2024
Address:
Boulder, Colorado
Editors:
Nianwen Xue, James Martin
Venues:
UMRPW | WS
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
8–15
Language:
URL:
https://preview.aclanthology.org/landing_page/2024.umrpw-1.2/
DOI:
Bibkey:
Cite (ACL):
Reza Takhshid, Tara Azin, Razieh Shojaei, and Mohammad Bahrani. 2024. Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset. In Proceedings of the 2024 UMR Parsing Workshop, pages 8–15, Boulder, Colorado. Association for Computational Linguistics.
Cite (Informal):
Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset (Takhshid et al., UMRPW 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2024.umrpw-1.2.pdf