Users and Data: The Two Neglected Children of Bilingual Natural Language Processing Research

Phillippe Langlais


Abstract
Despite numerous studies devoted to mining parallel material from bilingual data, we have yet to see the resulting technologies wholeheartedly adopted by professional translators and terminologists alike. I argue that this state of affairs is mainly due to two factors: the emphasis published authors put on models (even though data is as important), and the conspicuous lack of concern for actual end-users.
Anthology ID:
W17-2501
Volume:
Proceedings of the 10th Workshop on Building and Using Comparable Corpora
Month:
August
Year:
2017
Address:
Vancouver, Canada
Editors:
Serge Sharoff, Pierre Zweigenbaum, Reinhard Rapp
Venue:
BUCC
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–5
Language:
URL:
https://aclanthology.org/W17-2501
DOI:
10.18653/v1/W17-2501
Bibkey:
Cite (ACL):
Phillippe Langlais. 2017. Users and Data: The Two Neglected Children of Bilingual Natural Language Processing Research. In Proceedings of the 10th Workshop on Building and Using Comparable Corpora, pages 1–5, Vancouver, Canada. Association for Computational Linguistics.
Cite (Informal):
Users and Data: The Two Neglected Children of Bilingual Natural Language Processing Research (Langlais, BUCC 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-2/W17-2501.pdf