The texts in flickr5000.clus are a subset of the captions in the Flickr 8k dataset described in 
M. Hodosh, P. Young and J. Hockenmaier (2013) "Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics", Journal of Artificial Intelligence Research, Volume 47, pages 853-899 
and available from http://nlp.cs.illinois.edu/HockenmaierGroup/8k-pictures.html. As with the original Flickr8k, flickr5000.clus is released under a CreativeCommons Attribution-ShareAlike license, http://creativecommons.org/licenses/by/3.0/


The texts in pascalByWord.clus are the captions in the PASCAL dataset described in 
Cyrus Rashtchian, Peter Young, Micah Hodosh, and Julia Hockenmaier. Collecting Image Annotations Using Amazon's Mechanical Turk. In Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk
and available from http://nlp.cs.illinois.edu/HockenmaierGroup/pascal-sentences/index.html. As with the original PASCAL dataset, pascalByWord.clus is released under a CreativeCommons Attribution-ShareAlike license, http://creativecommons.org/licenses/by/3.0/
