Abstract
Text in social media posts is frequently accompanied by images in order to provide content, supply context, or to express feelings. This paper studies how the meaning of the entire tweet is composed through the relationship between its textual content and its image. We build and release a data set of image tweets annotated with four classes which express whether the text or the image provides additional information to the other modality. We show that by combining the text and image information, we can build a machine learning approach that accurately distinguishes between the relationship types. Further, we derive insights into how these relationships are materialized through text and image content analysis and how they are impacted by user demographic traits. These methods can be used in several downstream applications including pre-training image tagging models, collecting distantly supervised data for image captioning, and can be directly used in end-user applications to optimize screen estate.- Anthology ID:
- P19-1272
- Volume:
- Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
- Month:
- July
- Year:
- 2019
- Address:
- Florence, Italy
- Editors:
- Anna Korhonen, David Traum, Lluís Màrquez
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2830–2840
- Language:
- URL:
- https://aclanthology.org/P19-1272
- DOI:
- 10.18653/v1/P19-1272
- Cite (ACL):
- Alakananda Vempala and Daniel Preoţiuc-Pietro. 2019. Categorizing and Inferring the Relationship between the Text and Image of Twitter Posts. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 2830–2840, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Categorizing and Inferring the Relationship between the Text and Image of Twitter Posts (Vempala & Preoţiuc-Pietro, ACL 2019)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-5/P19-1272.pdf
- Data
- ImageNet