Contextual term equivalent search using domain-driven disambiguation

Caroline Barrière, Pierre André Ménard, Daphnée Azoulay


Abstract
This article presents a domain-driven algorithm for the task of term sense disambiguation (TSD). TSD aims at automatically choosing which term record from a term bank best represents the meaning of a term occurring in a particular context. In a translation environment, finding the contextually appropriate term record is necessary to access the proper equivalent to be used in the target language text. The term bank TERMIUM Plus, recently published as an open access repository, is chosen as a domain-rich resource for testing our TSD algorithm, using English and French as source and target languages. We devise an experiment using over 1300 English terms found in scientific articles, and show that our domain-driven TSD algorithm is able to bring the best term record, and therefore the best French equivalent, at the average rank of 1.69 compared to a baseline random rank of 3.51.
Anthology ID:
W16-4704
Volume:
Proceedings of the 5th International Workshop on Computational Terminology (Computerm2016)
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Patrick Drouin, Natalia Grabar, Thierry Hamon, Kyo Kageura, Koichi Takeuchi
Venue:
CompuTerm
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
21–29
Language:
URL:
https://aclanthology.org/W16-4704
DOI:
Bibkey:
Cite (ACL):
Caroline Barrière, Pierre André Ménard, and Daphnée Azoulay. 2016. Contextual term equivalent search using domain-driven disambiguation. In Proceedings of the 5th International Workshop on Computational Terminology (Computerm2016), pages 21–29, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Contextual term equivalent search using domain-driven disambiguation (Barrière et al., CompuTerm 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-bitext-workshop/W16-4704.pdf