JEIDA’s Bilingual Corpus and Other Corpora for NLP Research in Japan

Hitoshi Isahara

JEIDA’s Bilingual Corpus and Other Corpora for NLP Research in Japan

Abstract

The committee on text processing technology of JEIDA (Japan Electronics Industry Development Association) has been developing its bilingual corpus for research on machine translation systems since the 1996 Japanese fiscal year. An overview of this bilingual corpus is presented in this paper. And other linguistic data recently developed in Japan, which includes the RWC text database and the simple sentence data by the CRL and IPA.

Anthology ID:: 1997.mtsummit-papers.11
Volume:: Proceedings of Machine Translation Summit VI: Papers
Month:: October 29 – November 1
Year:: 1997
Address:: San Diego, California
Venue:: MTSummit
SIG:
Publisher:
Note:
Pages:: 142–147
Language:
URL:: https://aclanthology.org/1997.mtsummit-papers.11
DOI:
Bibkey:
Cite (ACL):: Hitoshi Isahara. 1997. JEIDA’s Bilingual Corpus and Other Corpora for NLP Research in Japan. In Proceedings of Machine Translation Summit VI: Papers, pages 142–147, San Diego, California.
Cite (Informal):: JEIDA’s Bilingual Corpus and Other Corpora for NLP Research in Japan (Isahara, MTSummit 1997)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingestion-script-update/1997.mtsummit-papers.11.pdf

PDF Search