Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs

Jing Yang Lee; Kong-Aik Lee; Woon-Seng Gan

Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs

Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan

Abstract

Open-domain Dialogue (OD) exhibits a one-to-many (o2m) property, whereby multiple appropriate responses exist for a single dialogue context. Despite prior research showing that modeling this property boosts response diversity, most modern LLM-based dialogue agents do not explicitly do so. In this work, we model the o2m property of OD in LLMs by decomposing OD generation into two key tasks: Multi-Response Generation (MRG) and Preference-based Selection (PS), which entail generating a set of n semantically and lexically diverse high-quality responses for a given dialogue context, followed by selecting a single response based on human preference, respectively. To facilitate MRG and PS, we introduce o2mDial, a dialogue corpus explicitly designed to capture the o2m property by featuring multiple plausible responses for each context. Leveraging o2mDial, we propose new in-context learning and instruction-tuning strategies, as well as novel evaluation metrics for MRG, alongside a model-based approach for PS. Empirical results demonstrate that applying the proposed two-stage framework to smaller LLMs for OD generation enhances overall response diversity while maintaining contextual coherence, improving response quality by up to 90%, bringing them closer to the performance of larger models.

Anthology ID:: 2025.gem-1.24
Volume:: Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²)
Month:: July
Year:: 2025
Address:: Vienna, Austria and virtual meeting
Editors:: Ofir Arviv, Miruna Clinciu, Kaustubh Dhole, Rotem Dror, Sebastian Gehrmann, Eliya Habba, Itay Itzhak, Simon Mille, Yotam Perlitz, Enrico Santus, João Sedoc, Michal Shmueli Scheuer, Gabriel Stanovsky, Oyvind Tafjord
Venues:: GEM | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 276–290
Language:
URL:: https://preview.aclanthology.org/nschneid-patch-1/2025.gem-1.24/
DOI:
Bibkey:
Cite (ACL):: Jing Yang Lee, Kong Aik Lee, and Woon-Seng Gan. 2025. Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs. In Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²), pages 276–290, Vienna, Austria and virtual meeting. Association for Computational Linguistics.
Cite (Informal):: Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs (Lee et al., GEM 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-1/2025.gem-1.24.pdf

PDF Cite Search Fix data