Abstract
We describe a method for expanding existing dictionaries in several languages by discovering previously non-existent links between translations. We call this method triangulation and we present and compare several variations of it. We assess precision manually, and recall by comparing the extracted dictionaries with independently obtained basic vocabulary sets. We featurize the translation candidates and train a maximum entropy classifier to identify correct translations in the noisy data.- Anthology ID:
- L14-1669
- Volume:
- Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
- Month:
- May
- Year:
- 2014
- Address:
- Reykjavik, Iceland
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 1938–1942
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/864_Paper.pdf
- DOI:
- Cite (ACL):
- Judit Ács. 2014. Pivot-based multilingual dictionary building using Wiktionary. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 1938–1942, Reykjavik, Iceland. European Language Resources Association (ELRA).
- Cite (Informal):
- Pivot-based multilingual dictionary building using Wiktionary (Ács, LREC 2014)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/864_Paper.pdf