Abstract
Most MT lexicography is devoted to developing rules of the kind, “in context C, translate source-language word S as target-language word T”. Very many such rules are required, producing them is laborious, and MT companies standardly spend large sums on it. We present the WASP-Bench, a lexicographer's workstation for the rapid and semi-automatic development of such rule-sets. The WASP-Bench makes use of a large source-language corpus and state-of-the-art techniques for Word Sense Disambiguation. We show that the WSD accuracy is on a par with the best results published to date, with the advantage that the WASP-Bench, unlike other high- performance systems, does not require a sense-disambiguated training corpus as input. The WASP-Bench is designed to fit readily with MT companies' working practices, as it may be used for as many or as few source language words as present disambiguation problems for a given target.- Anthology ID:
- 2001.mtsummit-papers.34
- Volume:
- Proceedings of Machine Translation Summit VIII
- Month:
- September 18-22
- Year:
- 2001
- Address:
- Santiago de Compostela, Spain
- Editor:
- Bente Maegaard
- Venue:
- MTSummit
- SIG:
- Publisher:
- Note:
- Pages:
- Language:
- URL:
- https://aclanthology.org/2001.mtsummit-papers.34
- DOI:
- Cite (ACL):
- Adam Kilgarriff and David Tugwell. 2001. WASP-Bench: an MT lexicographers’ workstation supporting state-of-the-art lexical disambiguation. In Proceedings of Machine Translation Summit VIII, Santiago de Compostela, Spain.
- Cite (Informal):
- WASP-Bench: an MT lexicographers’ workstation supporting state-of-the-art lexical disambiguation (Kilgarriff & Tugwell, MTSummit 2001)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2001.mtsummit-papers.34.pdf