Shinya Iizuka
2025
DSLCMM: A Multimodal Human-Machine Dialogue Corpus Built through Competitions
Ryuichiro Higashinaka
|
Tetsuro Takahashi
|
Shinya Iizuka
|
Sota Horiuchi
|
Michimasa Inaba
|
Zhiyang Qi
|
Yuta Sasaki
|
Kotaro Funakoshi
|
Shoji Moriya
|
Shiki Sato
|
Takashi Minato
|
Kurima Sakai
|
Tomo Funayama
|
Masato Komuro
|
Hiroyuki Nishikawa
|
Ryosaku Makino
|
Hirofumi Kikuchi
|
Mayumi Usami
Proceedings of the 15th International Workshop on Spoken Dialogue Systems Technology
A corpus of dialogues between multimodal systems and humans is indispensable for the development and improvement of such systems. However, there is a shortage of human-machine multimodal dialogue datasets, which hinders the widespread deployment of these systems in society. To address this issue, we construct a Japanese multimodal human-machine dialogue corpus, DSLCMM, by collecting and organizing data from the Dialogue System Live Competitions (DSLCs). This paper details the procedure for constructing the corpus and presents our analysis of the relationship between various dialogue features and evaluation scores provided by users.
2024
JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset
Atsumoto Ohashi
|
Ryu Hirai
|
Shinya Iizuka
|
Ryuichiro Higashinaka
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Dialogue datasets are crucial for deep learning-based task-oriented dialogue system research. While numerous English language multi-domain task-oriented dialogue datasets have been developed and contributed to significant advancements in task-oriented dialogue systems, such a dataset does not exist in Japanese, and research in this area is limited compared to that in English. In this study, towards the advancement of research and development of task-oriented dialogue systems in Japanese, we constructed JMultiWOZ, the first Japanese language large-scale multi-domain task-oriented dialogue dataset. Using JMultiWOZ, we evaluated the dialogue state tracking and response generation capabilities of the state-of-the-art methods on the existing major English benchmark dataset MultiWOZ2.2 and the latest large language model (LLM)-based methods. Our evaluation results demonstrated that JMultiWOZ provides a benchmark that is on par with MultiWOZ2.2. In addition, through evaluation experiments of interactive dialogues with the models and human participants, we identified limitations in the task completion capabilities of LLMs in Japanese.
Search
Fix data
Co-authors
- Ryuichiro Higashinaka 2
- Kotaro Funakoshi 1
- Tomo Funayama 1
- Ryu Hirai 1
- Sota Horiuchi 1
- show all...