A Two-Stage Progressive Intent Clustering for Task-Oriented Dialogue

Bingzhu Du, Nan Su, Yuchi Zhang, Yongliang Wang


Abstract
Natural Language Understanding (NLU) is one of the most critical components of task-oriented dialogue, and it is often considered as an intent classification task. To achieve outstanding intent identification performance, system designers often need to hire a large number of domain experts to label the data, which is inefficient and costly. To address this problem, researchers’ attention has gradually shifted to automatic intent clustering methods, which employ low-resource unsupervised approaches to solve classification problems. The classical framework for clustering is deep clustering, which uses deep neural networks (DNNs) to jointly optimize non-clustering loss and clustering loss. However, for new conversational domains or services, utterances required to assign intents are scarce and the performance of DNNs is often dependent on large amounts of data. In addition, although re-clustering with k-means algorithm after training the network usually leads to better results, k-means methods often suffer from poor stability. To address these problems, we propose an effective two-stage progressive approach to refine the clustering. Firstly, we pre-train the network with contrastive loss using all conversations data and then optimize the clustering loss and contrastive loss simultaneously. Secondly, we propose adaptive progressive k-means to alleviate the randomness of vanilla k-means, achieving better performance and smaller deviation. Our method ranks second in DSTC11 Track2 Task 1, a benchmark for intent clustering of task-oriented dialogue, demonstrating the superiority and effectiveness of our method.
Anthology ID:
2023.dstc-1.7
Volume:
Proceedings of The Eleventh Dialog System Technology Challenge
Month:
September
Year:
2023
Address:
Prague, Czech Republic
Editors:
Yun-Nung Chen, Paul Crook, Michel Galley, Sarik Ghazarian, Chulaka Gunasekara, Raghav Gupta, Behnam Hedayatnia, Satwik Kottur, Seungwhan Moon, Chen Zhang
Venues:
DSTC | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
48–56
Language:
URL:
https://aclanthology.org/2023.dstc-1.7
DOI:
Bibkey:
Cite (ACL):
Bingzhu Du, Nan Su, Yuchi Zhang, and Yongliang Wang. 2023. A Two-Stage Progressive Intent Clustering for Task-Oriented Dialogue. In Proceedings of The Eleventh Dialog System Technology Challenge, pages 48–56, Prague, Czech Republic. Association for Computational Linguistics.
Cite (Informal):
A Two-Stage Progressive Intent Clustering for Task-Oriented Dialogue (Du et al., DSTC-WS 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2023.dstc-1.7.pdf