2014
pdf
abs
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
Anabela Barreiro
|
Johanna Monti
|
Brigitte Orliac
|
Susanne Preuß
|
Kutz Arrieta
|
Wang Ling
|
Fernando Batista
|
Isabel Trancoso
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
This paper presents a systematic human evaluation of translations of English support verb constructions produced by a rule-based machine translation (RBMT) system (OpenLogos) and a statistical machine translation (SMT) system (Google Translate) for five languages: French, German, Italian, Portuguese and Spanish. We classify support verb constructions by means of their syntactic structure and semantic behavior and present a qualitative analysis of their translation errors. The study aims to verify how machine translation (MT) systems translate fine-grained linguistic phenomena, and how well-equipped they are to produce high-quality translation. Another goal of the linguistically motivated quality analysis of SVC raw output is to reinforce the need for better system hybridization, which leverages the strengths of RBMT to the benefit of SMT, especially in improving the translation of multiword units. Taking multiword units into account, we propose an effective method to achieve MT hybridization based on the integration of semantico-syntactic knowledge into SMT.
2013
pdf
When multiwords go bad in machine translation
Anabela Barreiro
|
Johanna Monti
|
Brigitte Orliac
|
Fernando Batista
Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technologies
2003
pdf
abs
Collocation extraction for machine translation
Brigitte Orliac
|
Mike Dillinger
Proceedings of Machine Translation Summit IX: Papers
This paper reports on the development of a collocation extraction system that is designed within a commercial machine translation system in order to take advantage of the robust syntactic analysis that the system offers and to use this analysis to refine collocation extraction. Embedding the extraction system also addresses the need to provide information about the source language collocations in a system-specific form to support automatic generation of a collocation rulebase for analysis and translation.
1999
pdf
abs
Term Builder: a lexical knowledge acquisition tool for the Logos machine translation system
Brigitte Orliac
|
Kutz Arrieta
Proceedings of Machine Translation Summit VII
Logos 8, the next generation of the Logos Machine Translation (MT) system, is a client server application, which realizes the latest advances in system design and architecture. A multi-user, networkable application, Logos 8 allows Internet or Intranet use of its applications with client interfaces that communicate with dictionaries and translation servers through a common gateway. The new Logos 8 technology is based on a relational database for storage and organization of the lexical data. In this paper, we present Term-Builder, the Lexical Knowledge Acquisition tool developed for Logos 8. The new automatic coding functionality within Term-Builder is significantly improving the process of acquiring new lexicons for MT and other applications.
1998
pdf
abs
Logos8 system description
Brigitte Orliac
Proceedings of the Third Conference of the Association for Machine Translation in the Americas: System Descriptions
The globalization of the information exchange made possible by the Internet and the World Wide Web has led to an increasing demand for translation and other language-enabled tools and services. Developers of Machine Translation (MT) systems are best positioned to address the international community ever growing need for information processing technologies. Today Logos offers its MT technology in a relational model on NT and Unix servers with net-centric Java clients. The new model realized in Logos8 is also preparing the system for use on the Internet as an information-gathering utility. This paper describes the new Logos8 system and presents the product developments made possible by the new system.