Forecasting Conversation Derailments Through Generation

Yunfan Zhang, Kathleen McKeown, Smaranda Muresan


Abstract
Forecasting conversation derailment can be useful in real-world settings such as online content moderation, conflict resolution, and business negotiations. However, despite language models’ success at identifying offensive speech present in conversations, they struggle to forecast future conversation derailments. In contrast to prior work that predicts conversation outcomes solely based on the past conversation history, our approach samples multiple future conversation trajectories conditioned on existing conversation history using a fine-tuned LLM. It predicts the conversation outcome based on the consensus of these trajectories. We also experimented with leveraging socio-linguistic attributes, which reflect turn-level conversation dynamics, as guidance when generating future conversations. Our method of future conversation trajectories surpasses state-of-the-art results on English conversation derailment prediction benchmarks and demonstrates significant accuracy gains in ablation studies.
Anthology ID:
2025.inlg-main.40
Volume:
Proceedings of the 18th International Natural Language Generation Conference
Month:
October
Year:
2025
Address:
Hanoi, Vietnam
Editors:
Lucie Flek, Shashi Narayan, Lê Hồng Phương, Jiahuan Pei
Venue:
INLG
SIG:
SIGGEN
Publisher:
Association for Computational Linguistics
Note:
Pages:
699–715
Language:
URL:
https://preview.aclanthology.org/author-page-lei-gao-usc/2025.inlg-main.40/
DOI:
Bibkey:
Cite (ACL):
Yunfan Zhang, Kathleen McKeown, and Smaranda Muresan. 2025. Forecasting Conversation Derailments Through Generation. In Proceedings of the 18th International Natural Language Generation Conference, pages 699–715, Hanoi, Vietnam. Association for Computational Linguistics.
Cite (Informal):
Forecasting Conversation Derailments Through Generation (Zhang et al., INLG 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/author-page-lei-gao-usc/2025.inlg-main.40.pdf