Data Selection for Multi-turn Dialogue Instruction Tuning

Bo Li; Shikun Zhang; Wei Ye

Data Selection for Multi-turn Dialogue Instruction Tuning

Abstract

Instruction-tuned language models increasingly rely on large multi-turn dialogue corpora, but these datasets are often noisy and structurally inconsistent, with topic drift, repetitive chitchat, and mismatched answer formats across turns. We address this from a data selection perspective and propose MDS (Multi-turn Dialogue Selection), a dialogue-level framework that scores whole conversations rather than isolated turns. MDS combines a global coverage stage that performs bin-wise selection in the user-query trajectory space to retain representative yet non-redundant dialogues, with a local structural stage that evaluates within-dialogue reliability through entity-grounded topic grounding and information progress, together with query-answer form consistency for functional alignment. MDS outperforms strong single-turn selectors, dialogue-level LLM scorers, and heuristic baselines on three multi-turn benchmarks and an in-domain Banking test set, achieving the best overall rank across reference-free and reference-based metrics, and is more robust on long conversations under the same training budget. Code and resources are included in the supplementary materials.

Anthology ID:: 2026.findings-acl.130
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2724–2739
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.130/
DOI:
Bibkey:
Cite (ACL):: Bo Li, Shikun Zhang, and Wei Ye. 2026. Data Selection for Multi-turn Dialogue Instruction Tuning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 2724–2739, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Data Selection for Multi-turn Dialogue Instruction Tuning (Li et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.130.pdf
Checklist:: 2026.findings-acl.130.checklist.pdf

PDF Cite Search Checklist Fix data