Abstract
Image description datasets, such as Flickr30K and MS COCO, show a high degree of variation in the ways that crowd-workers talk about the world. Although this gives us a rich and diverse collection of data to work with, it also introduces uncertainty about how the world should be described. This paper shows the extent of this uncertainty in the PEOPLE-domain. We present a taxonomy of different ways to talk about other people. This taxonomy serves as a reference point to think about how other people should be described, and can be used to classify and compute statistics about labels applied to people.- Anthology ID:
- W18-6550
- Volume:
- Proceedings of the 11th International Conference on Natural Language Generation
- Month:
- November
- Year:
- 2018
- Address:
- Tilburg University, The Netherlands
- Editors:
- Emiel Krahmer, Albert Gatt, Martijn Goudbeek
- Venue:
- INLG
- SIG:
- SIGGEN
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 415–420
- Language:
- URL:
- https://aclanthology.org/W18-6550
- DOI:
- 10.18653/v1/W18-6550
- Cite (ACL):
- Emiel van Miltenburg, Desmond Elliott, and Piek Vossen. 2018. Talking about other people: an endless range of possibilities. In Proceedings of the 11th International Conference on Natural Language Generation, pages 415–420, Tilburg University, The Netherlands. Association for Computational Linguistics.
- Cite (Informal):
- Talking about other people: an endless range of possibilities (van Miltenburg et al., INLG 2018)
- PDF:
- https://preview.aclanthology.org/emnlp22-frontmatter/W18-6550.pdf
- Code
- evanmiltenburg/LabelingPeople
- Data
- Flickr30k, MS COCO