Applying machine translation to two-stage cross-language information retrieval

Atsushi Fujii; Tetsuya Ishikawa

Applying machine translation to two-stage cross-language information retrieval

Abstract

Cross-language information retrieval (CLIR), where queries and documents are in different languages, needs a translation of queries and/or documents, so as to standardize both of them into a common representation. For this purpose, the use of machine translation is an effective approach. However, computational cost is prohibitive in translating large-scale document collections. To resolve this problem, we propose a two-stage CLIR method. First, we translate a given query into the document language, and retrieve a limited number of foreign documents. Second, we machine translate only those documents into the user language, and re-rank them based on the translation result. We also show the effectiveness of our method by way of experiments using Japanese queries and English technical documents.

Anthology ID:: 2000.amta-papers.2
Volume:: Proceedings of the Fourth Conference of the Association for Machine Translation in the Americas: Technical Papers
Month:: October 10-14
Year:: 2000
Address:: Cuernavaca, Mexico
Venue:: AMTA
SIG:
Publisher:: Springer
Note:
Pages:: 13–24
Language:
URL:: https://link.springer.com/chapter/10.1007/3-540-39965-8_2
DOI:
Bibkey:
Cite (ACL):: Atsushi Fujii and Tetsuya Ishikawa. 2000. Applying machine translation to two-stage cross-language information retrieval. In Proceedings of the Fourth Conference of the Association for Machine Translation in the Americas: Technical Papers, pages 13–24, Cuernavaca, Mexico. Springer.
Cite (Informal):: Applying machine translation to two-stage cross-language information retrieval (Fujii & Ishikawa, AMTA 2000)
Copy Citation:
PDF:: https://link.springer.com/chapter/10.1007/3-540-39965-8_2

PDF Search