User-Adaptive A Posteriori Restoration for Incorrectly Segmented Utterances in Spoken Dialogue Systems

Kazunori Komatani, Naoki Hotta, Satoshi Sato, Mikio Nakano


Abstract
Ideally, the users of spoken dialogue systems should be able to speak at their own tempo. Thus, the systems needs to interpret utterances from various users correctly, even when the utterances contain pauses. In response to this issue, we propose an approach based on a posteriori restoration for incorrectly segmented utterances. A crucial part of this approach is to determine whether restoration is required. We use a classification-based approach, adapted to each user. We focus on each user’s dialogue tempo, which can be obtained during the dialogue, and determine the correlation between each user’s tempo and the appropriate thresholds for classification. A linear regression function used to convert the tempos into thresholds is also derived. Experimental results show that the proposed user adaptation approach applied to two restoration classification methods, thresholding and decision trees, improves classification accuracies by 3.0% and 7.4%, respectively, in cross validation.
Anthology ID:
2017.dnd-8.2
Volume:
Dialogue Discourse Volume 8
Month:
Year:
2017
Address:
Editors:
Amanda Stent, Maite Taboada, Raquel Fernández, David Traum, Massimo Poesio, Barbara Di Eugenio, Manfred Stede
Venue:
DND
SIG:
SIGDIAL
Publisher:
Note:
Pages:
206–224
Language:
URL:
https://preview.aclanthology.org/ingest-dnd/2017.dnd-8.2/
DOI:
10.5087/dad.2017.209
Bibkey:
Cite (ACL):
Kazunori Komatani, Naoki Hotta, Satoshi Sato, and Mikio Nakano. 2017. User-Adaptive A Posteriori Restoration for Incorrectly Segmented Utterances in Spoken Dialogue Systems. Dialogue & Discourse, 8:206–224.
Cite (Informal):
User-Adaptive A Posteriori Restoration for Incorrectly Segmented Utterances in Spoken Dialogue Systems (Komatani et al., DND 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-dnd/2017.dnd-8.2.pdf