A Corpus for Personalized Dialogue Breakdown Repair in Japanese Open-Domain Conversations

Kazuya Tsubokura, Yurie Iribe, Norihide Kitaoka


Abstract
Recent advances in dialogue systems have been remarkable; however, conversational breakdowns still occur, making it essential to develop appropriate repair strategies. Nevertheless, when a system breakdown actually occurs, it remains unclear how the system should perform the repair, and no corpus has been available to investigate this issue. To address this gap, we presented typical examples of system-induced dialogue breakdowns to crowd workers and collected their expected repair utterances toward the broken system. Each repair utterance was annotated with dialogue act tags, and we constructed a breakdown-repair corpus consisting of 3,990 utterances covering ten representative types of breakdowns. This corpus includes breakdown cases across diverse situations, allowing for the examination of various repair patterns. Furthermore, we also conducted a questionnaire on participants’ personal traits, creating a dataset that enables the investigation of repair strategies tailored to individual user characteristics. In this paper, we report an overview of the dataset and preliminary analysis results.
Anthology ID:
2026.lrec-main.227
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
2899–2912
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.227/
DOI:
Bibkey:
Cite (ACL):
Kazuya Tsubokura, Yurie Iribe, and Norihide Kitaoka. 2026. A Corpus for Personalized Dialogue Breakdown Repair in Japanese Open-Domain Conversations. International Conference on Language Resources and Evaluation, main:2899–2912.
Cite (Informal):
A Corpus for Personalized Dialogue Breakdown Repair in Japanese Open-Domain Conversations (Tsubokura et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.227.pdf