Anaphoric Phenomena in Situated dialog: A First Round of Annotations

Sharid Loáiciga, Simon Dobnik, David Schlangen


Abstract
We present a first release of 500 documents from the multimodal corpus Tell-me-more (Ilinykh et al., 2019) annotated with coreference information according to the ARRAU guidelines (Poesio et al., 2021). The corpus consists of images and short texts of five sentences. We describe the annotation process and present the adaptations to the original guidelines in order to account for the challenges of grounding the annotations to the image. 50 documents from the 500 available are annotated by two people and used to estimate inter-annotator agreement (IAA) relying on Krippendorff’s alpha.
Anthology ID:
2022.crac-1.4
Volume:
Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference
Month:
October
Year:
2022
Address:
Gyeongju, Republic of Korea
Venue:
CRAC
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
31–37
Language:
URL:
https://aclanthology.org/2022.crac-1.4
DOI:
Bibkey:
Cite (ACL):
Sharid Loáiciga, Simon Dobnik, and David Schlangen. 2022. Anaphoric Phenomena in Situated dialog: A First Round of Annotations. In Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference, pages 31–37, Gyeongju, Republic of Korea. Association for Computational Linguistics.
Cite (Informal):
Anaphoric Phenomena in Situated dialog: A First Round of Annotations (Loáiciga et al., CRAC 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2022.crac-1.4.pdf