Bilingual Dictionary Induction as an Optimization Problem
Wushouer Mairidan, Toru Ishida, Donghui Lin, Katsutoshi Hirayama
Abstract
Bilingual dictionaries are vital in many areas of natural language processing, but such resources are rarely available for lower-density language pairs, especially for those that are closely related. Pivot-based induction consists of using a third language to bridge a language pair. As an approach to create new dictionaries, it can generate wrong translations due to polysemy and ambiguous words. In this paper we propose a constraint approach to pivot-based dictionary induction for the case of two closely related languages. In order to take into account the word senses, we use an approach based on semantic distances, in which possibly missing translations are considered, and instance of induction is encoded as an optimization problem to generate new dictionary. Evaluations show that the proposal achieves 83.7% accuracy and approximately 70.5% recall, thus outperforming the baseline pivot-based method.- Anthology ID:
- L14-1358
- Volume:
- Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
- Month:
- May
- Year:
- 2014
- Address:
- Reykjavik, Iceland
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 2122–2129
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/417_Paper.pdf
- DOI:
- Cite (ACL):
- Wushouer Mairidan, Toru Ishida, Donghui Lin, and Katsutoshi Hirayama. 2014. Bilingual Dictionary Induction as an Optimization Problem. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2122–2129, Reykjavik, Iceland. European Language Resources Association (ELRA).
- Cite (Informal):
- Bilingual Dictionary Induction as an Optimization Problem (Mairidan et al., LREC 2014)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/417_Paper.pdf