Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset

Reza Takhshid; Tara Azin; Razieh Shojaei; Mohammad Bahrani

Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset

Reza Takhshid, Tara Azin, Razieh Shojaei, Mohammad Bahrani

Abstract

This paper introduces the Persian Abstract Meaning Representation (AMR) guidelines, a detailed guide for annotating Persian sentences with AMR, focusing on the necessary adaptations to fit Persian’s unique syntactic structures. We discuss the development process of a Persian AMR gold standard dataset consisting of 1562 sentences created following the guidelines. By examining the language specifications and nuances that distinguish AMR annotations of a low-resource language like Persian, we shed light on the challenges and limitations of developing a universal meaning representation framework. The guidelines and the dataset introduced in this study highlight such challenges, aiming to advance the field.

Anthology ID:: 2024.umrpw-1.2
Volume:: Proceedings of the 2024 UMR Parsing Workshop
Month:: June
Year:: 2024
Address:: Boulder, Colorado
Editors:: Nianwen Xue, James Martin
Venues:: UMRPW | WS
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8–15
Language:
URL:: https://preview.aclanthology.org/landing_page/2024.umrpw-1.2/
DOI:
Bibkey:
Cite (ACL):: Reza Takhshid, Tara Azin, Razieh Shojaei, and Mohammad Bahrani. 2024. Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset. In Proceedings of the 2024 UMR Parsing Workshop, pages 8–15, Boulder, Colorado. Association for Computational Linguistics.
Cite (Informal):: Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset (Takhshid et al., UMRPW 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/landing_page/2024.umrpw-1.2.pdf

PDF Cite Search Fix data