Norihito Yasuda
Also published as: Norihi Yasuda
2024
JaParaPat: A Large-Scale Japanese-English Parallel Patent Application Corpus
Masaaki Nagata | Makoto Morishita | Katsuki Chousa | Norihito Yasuda
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Masaaki Nagata | Makoto Morishita | Katsuki Chousa | Norihito Yasuda
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
We constructed JaParaPat (Japanese-English Parallel Patent Application Corpus), a bilingual corpus of more than 300 million Japanese-English sentence pairs from patent applications published in Japan and the United States from 2000 to 2021. We obtained the publication of unexamined patent applications from the Japan Patent Office (JPO) and the United States Patent and Trademark Office (USPTO). We also obtained patent family information from the DOCDB, that is a bibliographic database maintained by the European Patent Office (EPO). We extracted approximately 1.4M Japanese-English document pairs, which are translations of each other based on the patent families, and extracted about 350M sentence pairs from the document pairs using a translation-based sentence alignment method whose initial translation model is bootstrapped from a dictionary-based sentence alignment. We experimentally improved the accuracy of the patent translations by 20 bleu points by adding more than 300M sentence pairs obtained from patent applications to 22M sentence pairs obtained from the web.
2015
A Dynamic Programming Algorithm for Tree Trimming-based Text Summarization
Masaaki Nishino | Norihito Yasuda | Tsutomu Hirao | Shin-ichi Minato | Masaaki Nagata
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Masaaki Nishino | Norihito Yasuda | Tsutomu Hirao | Shin-ichi Minato | Masaaki Nagata
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
2013
Single-Document Summarization as a Tree Knapsack Problem
Tsutomu Hirao | Yasuhisa Yoshida | Masaaki Nishino | Norihito Yasuda | Masaaki Nagata
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
Tsutomu Hirao | Yasuhisa Yoshida | Masaaki Nishino | Norihito Yasuda | Masaaki Nagata
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
2008
Test Collections for Spoken Document Retrieval from Lecture Audio Data
Tomoyosi Akiba | Kiyoaki Aikawa | Yoshiaki Itoh | Tatsuya Kawahara | Hiroaki Nanjo | Hiromitsu Nishizaki | Norihito Yasuda | Yoichi Yamashita | Katunobu Itou
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
Tomoyosi Akiba | Kiyoaki Aikawa | Yoshiaki Itoh | Tatsuya Kawahara | Hiroaki Nanjo | Hiromitsu Nishizaki | Norihito Yasuda | Yoichi Yamashita | Katunobu Itou
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
The Spoken Document Processing Working Group, which is part of the special interest group of spoken language processing of the Information Processing Society of Japan, is developing a test collection for evaluation of spoken document retrieval systems. A prototype of the test collection consists of a set of textual queries, relevant segment lists, and transcriptions by an automatic speech recognition system, allowing retrieval from the Corpus of Spontaneous Japanese (CSJ). From about 100 initial queries, application of the criteria that a query should have more than five relevant segments that consist of about one minute speech segments yielded 39 queries. Targeting the test collection, an ad hoc retrieval experiment was also conducted to assess the baseline retrieval performance by applying a standard method for spoken document retrieval.
2007
Japanese Dependency Parsing Using Sequential Labeling for Semi-spoken Language
Kenji Imamura | Genichiro Kikui | Norihito Yasuda
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions
Kenji Imamura | Genichiro Kikui | Norihito Yasuda
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions
2005
Does this Answer your Question? Towards Dialogue Management for Restricted Domain Question Answering Systems
Matthias Denecke | Norihito Yasuda
Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue
Matthias Denecke | Norihito Yasuda
Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue
2001
Spoken Dialogue Control Based on a Turn-minimization Criterion Depending on the Speech Recognition Accuracy
Norihi Yasuda | Kohji Dohsaka | Kiyoaki Aikawa
Proceedings of the Second SIGdial Workshop on Discourse and Dialogue
Norihi Yasuda | Kohji Dohsaka | Kiyoaki Aikawa
Proceedings of the Second SIGdial Workshop on Discourse and Dialogue
2000
Search
Fix author
Co-authors
- Kiyoaki Aikawa 3
- Masaaki Nagata 3
- Kohji Dohsaka 2
- Tsutomu Hirao 2
- Masaaki Nishino 2
- Tomoyosi Akiba 1
- Katsuki Chousa 1
- Matthias Denecke 1
- Jun-ichi Hirasawa 1
- Kenji Imamura 1
- Yoshiaki Itoh 1
- Katunobu Itou 1
- Tatsuya Kawahara 1
- Gen-ichiro Kikui 1
- Shin-ichi Minato 1
- Noboru Miyazaki 1
- Makoto Morishita 1
- Mikio Nakano 1
- Hiroaki Nanjo 1
- Hiromitsu Nishizaki 1
- Akira Sugiyama 1
- Yoichi Yamashita 1
- Yasuhisa Yoshida 1