Nobushige Doi
2020
TDDC: Timely Disclosure Documents Corpus
Nobushige Doi
|
Yusuke Oda
|
Toshiaki Nakazawa
Proceedings of the Twelfth Language Resources and Evaluation Conference
In this paper, we describe the details of the Timely Disclosure Documents Corpus (TDDC). TDDC was prepared by manually aligning the sentences from past Japanese and English timely disclosure documents in PDF format published by companies listed on the Tokyo Stock Exchange. TDDC consists of approximately 1.4 million parallel sentences in Japanese and English. TDDC was used as the official dataset for the 6th Workshop on Asian Translation to encourage the development of machine translation.
2019
Proceedings of the 6th Workshop on Asian Translation
Toshiaki Nakazawa
|
Chenchen Ding
|
Raj Dabre
|
Anoop Kunchukuttan
|
Nobushige Doi
|
Yusuke Oda
|
Ondřej Bojar
|
Shantipriya Parida
|
Isao Goto
|
Hidaya Mino
Proceedings of the 6th Workshop on Asian Translation
Overview of the 6th Workshop on Asian Translation
Toshiaki Nakazawa
|
Nobushige Doi
|
Shohei Higashiyama
|
Chenchen Ding
|
Raj Dabre
|
Hideya Mino
|
Isao Goto
|
Win Pa Pa
|
Anoop Kunchukuttan
|
Yusuke Oda
|
Shantipriya Parida
|
Ondřej Bojar
|
Sadao Kurohashi
Proceedings of the 6th Workshop on Asian Translation
This paper presents the results of the shared tasks from the 6th workshop on Asian translation (WAT2019) including Ja↔En, Ja↔Zh scientific paper translation subtasks, Ja↔En, Ja↔Ko, Ja↔En patent translation subtasks, Hi↔En, My↔En, Km↔En, Ta↔En mixed domain subtasks and Ru↔Ja news commentary translation task. For the WAT2019, 25 teams participated in the shared tasks. We also received 10 research paper submissions out of which 61 were accepted. About 400 translation results were submitted to the automatic evaluation server, and selected submis- sions were manually evaluated.
Search
Co-authors
- Yusuke Oda 3
- Toshiaki Nakazawa 3
- Chenchen Ding 2
- Raj Dabre 2
- Anoop Kunchukuttan 2
- show all...