Experiences in Resource Generation for Machine Translation through Crowdsourcing

Anoop Kunchukuttan, Shourya Roy, Pratik Patel, Kushal Ladha, Somya Gupta, Mitesh M. Khapra, Pushpak Bhattacharyya


Abstract
The logistics of collecting resources for Machine Translation (MT) has always been a cause of concern for some of the resource deprived languages of the world. The recent advent of crowdsourcing platforms provides an opportunity to explore the large scale generation of resources for MT. However, before venturing into this mode of resource collection, it is important to understand the various factors such as, task design, crowd motivation, quality control, etc. which can influence the success of such a crowd sourcing venture. In this paper, we present our experiences based on a series of experiments performed. This is an attempt to provide a holistic view of the different facets of translation crowd sourcing and identifying key challenges which need to be addressed for building a practical crowdsourcing solution for MT.
Anthology ID:
L12-1127
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
384–391
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/292_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Anoop Kunchukuttan, Shourya Roy, Pratik Patel, Kushal Ladha, Somya Gupta, Mitesh M. Khapra, and Pushpak Bhattacharyya. 2012. Experiences in Resource Generation for Machine Translation through Crowdsourcing. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 384–391, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Experiences in Resource Generation for Machine Translation through Crowdsourcing (Kunchukuttan et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/292_Paper.pdf