Abstract
We present a first release of 500 documents from the multimodal corpus Tell-me-more (Ilinykh et al., 2019) annotated with coreference information according to the ARRAU guidelines (Poesio et al., 2021). The corpus consists of images and short texts of five sentences. We describe the annotation process and present the adaptations to the original guidelines in order to account for the challenges of grounding the annotations to the image. 50 documents from the 500 available are annotated by two people and used to estimate inter-annotator agreement (IAA) relying on Krippendorff’s alpha.- Anthology ID:
- 2022.crac-1.4
- Volume:
- Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference
- Month:
- October
- Year:
- 2022
- Address:
- Gyeongju, Republic of Korea
- Venue:
- CRAC
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 31–37
- Language:
- URL:
- https://aclanthology.org/2022.crac-1.4
- DOI:
- Cite (ACL):
- Sharid Loáiciga, Simon Dobnik, and David Schlangen. 2022. Anaphoric Phenomena in Situated dialog: A First Round of Annotations. In Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference, pages 31–37, Gyeongju, Republic of Korea. Association for Computational Linguistics.
- Cite (Informal):
- Anaphoric Phenomena in Situated dialog: A First Round of Annotations (Loáiciga et al., CRAC 2022)
- PDF:
- https://preview.aclanthology.org/starsem-semeval-split/2022.crac-1.4.pdf