Spanish Abstract Meaning Representation: Annotation of a General Corpus
Shira Wein, Lucia Donatelli, Ethan Ricker, Calvin Engstrom, Alex Nelson, Leonie Harter, Nathan Schneider
Abstract
Abstract Meaning Representation (AMR), originally designed for English, has been adapted to a number of languages to facilitate cross-lingual semantic representation and analysis. We build on previous work and present the first sizable, general annotation project for Spanish AMR. We release a detailed set of annotation guidelines and a corpus of 486 gold-annotated sentences spanning multiple genres from an existing, cross-lingual AMR corpus. Our work constitutes the second largest non-English gold AMR corpus to date. Fine-tuning an AMR to-Spanish generation model with our annotations results in a BERTScore improvement of 8.8%, demonstrating initial utility of our work.- Anthology ID:
- 2022.nejlt-1.6
- Volume:
- Northern European Journal of Language Technology, Volume 8
- Month:
- Year:
- 2022
- Address:
- Copenhagen, Denmark
- Venue:
- NEJLT
- SIG:
- Publisher:
- Northern European Association of Language Technology
- Note:
- Pages:
- Language:
- URL:
- https://aclanthology.org/2022.nejlt-1.6
- DOI:
- https://doi.org/10.3384/nejlt.2000-1533.2022.4462
- Cite (ACL):
- Shira Wein, Lucia Donatelli, Ethan Ricker, Calvin Engstrom, Alex Nelson, Leonie Harter, and Nathan Schneider. 2022. Spanish Abstract Meaning Representation: Annotation of a General Corpus. In Northern European Journal of Language Technology, Volume 8, Copenhagen, Denmark. Northern European Association of Language Technology.
- Cite (Informal):
- Spanish Abstract Meaning Representation: Annotation of a General Corpus (Wein et al., NEJLT 2022)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/2022.nejlt-1.6.pdf