Building Domain Specific Bilingual Dictionaries
Lucas Hilgert, Lucelene Lopes, Artur Freitas, Renata Vieira, Denise Hogetop, Aline Vanin
Abstract
This paper proposes a method to build bilingual dictionaries for specific domains defined by a parallel corpora. The proposed method is based on an original method that is not domain specific. Both the original and the proposed methods are constructed with previously available natural language processing tools. Therefore, this paper contribution resides in the choice and parametrization of the chosen tools. To illustrate the proposed method benefits we conduct an experiment over technical manuals in English and Portuguese. The results of our proposed method were analyzed by human specialists and our results indicates significant increases in precision for unigrams and muli-grams. Numerically, the precision increase is as big as 15% according to our evaluation.- Anthology ID:
- L14-1084
- Volume:
- Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
- Month:
- May
- Year:
- 2014
- Address:
- Reykjavik, Iceland
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 2772–2777
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/1112_Paper.pdf
- DOI:
- Cite (ACL):
- Lucas Hilgert, Lucelene Lopes, Artur Freitas, Renata Vieira, Denise Hogetop, and Aline Vanin. 2014. Building Domain Specific Bilingual Dictionaries. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2772–2777, Reykjavik, Iceland. European Language Resources Association (ELRA).
- Cite (Informal):
- Building Domain Specific Bilingual Dictionaries (Hilgert et al., LREC 2014)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/1112_Paper.pdf