CORE: Cooperative Training of Retriever-Reranker for Effective Dialogue Response Selection

Chongyang Tao; Jiazhan Feng; Tao Shen; Chang Liu; Juntao Li; Xiubo Geng; Daxin Jiang

doi:10.18653/v1/2023.acl-long.174

CORE: Cooperative Training of Retriever-Reranker for Effective Dialogue Response Selection

Chongyang Tao, Jiazhan Feng, Tao Shen, Chang Liu, Juntao Li, Xiubo Geng, Daxin Jiang

Abstract

Establishing retrieval-based dialogue systems that can select appropriate responses from the pre-built index has gained increasing attention. Recent common practice is to construct a two-stage pipeline with a fast retriever (e.g., bi-encoder) for first-stage recall followed by a smart response reranker (e.g., cross-encoder) for precise ranking. However, existing studies either optimize the retriever and reranker in independent ways, or distill the knowledge from a pre-trained reranker into the retriever in an asynchronous way, leading to sub-optimal performance of both modules. Thus, an open question remains about how to train them for a better combination of the best of both worlds. To this end, we present a cooperative training of the response retriever and the reranker whose parameters are dynamically optimized by the ground-truth labels as well as list-wise supervision signals from each other. As a result, the two modules can learn from each other and evolve together throughout the training. Experimental results on two benchmarks demonstrate the superiority of our method.

Anthology ID:: 2023.acl-long.174
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3102–3114
Language:
URL:: https://aclanthology.org/2023.acl-long.174
DOI:: 10.18653/v1/2023.acl-long.174
Bibkey:
Cite (ACL):: Chongyang Tao, Jiazhan Feng, Tao Shen, Chang Liu, Juntao Li, Xiubo Geng, and Daxin Jiang. 2023. CORE: Cooperative Training of Retriever-Reranker for Effective Dialogue Response Selection. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3102–3114, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: CORE: Cooperative Training of Retriever-Reranker for Effective Dialogue Response Selection (Tao et al., ACL 2023)
Copy Citation:
PDF:: https://preview.aclanthology.org/emnlp-22-attachments/2023.acl-long.174.pdf

PDF Search