Abstract
The committee on text processing technology of JEIDA (Japan Electronics Industry Development Association) has been developing its bilingual corpus for research on machine translation systems since the 1996 Japanese fiscal year. An overview of this bilingual corpus is presented in this paper. And other linguistic data recently developed in Japan, which includes the RWC text database and the simple sentence data by the CRL and IPA.- Anthology ID:
- 1997.mtsummit-papers.11
- Volume:
- Proceedings of Machine Translation Summit VI: Papers
- Month:
- October 29 – November 1
- Year:
- 1997
- Address:
- San Diego, California
- Venue:
- MTSummit
- SIG:
- Publisher:
- Note:
- Pages:
- 142–147
- Language:
- URL:
- https://aclanthology.org/1997.mtsummit-papers.11
- DOI:
- Cite (ACL):
- Hitoshi Isahara. 1997. JEIDA’s Bilingual Corpus and Other Corpora for NLP Research in Japan. In Proceedings of Machine Translation Summit VI: Papers, pages 142–147, San Diego, California.
- Cite (Informal):
- JEIDA’s Bilingual Corpus and Other Corpora for NLP Research in Japan (Isahara, MTSummit 1997)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/1997.mtsummit-papers.11.pdf