@inproceedings{afli-etal-2017-multinews,
    title = "{M}ulti{N}ews: A Web collection of an Aligned Multimodal and Multilingual Corpus",
    author = "Afli, Haithem  and
      Lohar, Pintu  and
      Way, Andy",
    editor = "Afli, Haithem  and
      Liu, Chao-Hong",
    booktitle = "Proceedings of the First Workshop on Curation and Applications of Parallel and Comparable Corpora",
    month = nov,
    year = "2017",
    address = "Taipei, Taiwan",
    publisher = "Asian Federation of Natural Language Processing",
    url = "https://aclanthology.org/W17-5602",
    pages = "11--15",
    abstract = "Integrating Natural Language Processing (NLP) and computer vision is a promising effort. However, the applicability of these methods directly depends on the availability of a specific multimodal data that includes images and texts. In this paper, we present a collection of a Multimodal corpus of comparable texts and their images in 9 languages from the web news articles of Euronews website. This corpus has found widespread use in the NLP community in Multilingual and multimodal tasks. Here, we focus on its acquisition of the images and text data and their multilingual alignment.",
}
Markdown (Informal)
[MultiNews: A Web collection of an Aligned Multimodal and Multilingual Corpus](https://aclanthology.org/W17-5602) (Afli et al., 2017)
ACL