The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus

Akira Hayakawa, Saturnino Luz, Loredana Cerrato, Nick Campbell


Abstract
This paper presents the multimodal Interlingual Map Task Corpus (ILMT-s2s corpus) collected at Trinity College Dublin, and discuss some of the issues related to the collection and analysis of the data. The corpus design is inspired by the HCRC Map Task Corpus which was initially designed to support the investigation of linguistic phenomena, and has been the focus of a variety of studies of communicative behaviour. The simplicity of the task, and the complexity of phenomena it can elicit, make the map task an ideal object of study. Although there are studies that used replications of the map task to investigate communication in computer mediated tasks, this ILMT-s2s corpus is, to the best of our knowledge, the first investigation of communicative behaviour in the presence of three additional “filters”: Automatic Speech Recognition (ASR), Machine Translation (MT) and Text To Speech (TTS) synthesis, where the instruction giver and the instruction follower speak different languages. This paper details the data collection setup and completed annotation of the ILMT-s2s corpus, and outlines preliminary results obtained from the data.
Anthology ID:
L16-1096
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
605–612
Language:
URL:
https://aclanthology.org/L16-1096
DOI:
Bibkey:
Cite (ACL):
Akira Hayakawa, Saturnino Luz, Loredana Cerrato, and Nick Campbell. 2016. The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 605–612, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus (Hayakawa et al., LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-1/L16-1096.pdf