Shannan Liu
2026
DraDDP: A Multimodal Multi-Party Dialogue Discourse Parsing Dataset
Shannan Liu | Peifeng Li | Yaxin Fan | Qiaoming Zhu
Findings of the Association for Computational Linguistics: ACL 2026
Shannan Liu | Peifeng Li | Yaxin Fan | Qiaoming Zhu
Findings of the Association for Computational Linguistics: ACL 2026
Multi-party dialogue discourse parsing aims to identify dependency structures and relation types between utterances in conversations. Previous studies are mostly limited to textual modality or two-party dialogue, failing to meet the multimodal and multi-party settings. In this paper, we construct the first publicly available English multimodal dataset DraDDP for multi-party dialogue discourse parsing, based on American TV dramas. DraDDP contains 495 dialogue segments with 6,374 utterances and 9.1 hours of parallel video content, covering rich multi-party interaction scenarios. Moreover, we establish comprehensive benchmarks by evaluating this task on DraDDP and conducting in-depth analysis on the impact of different modalities. Experimental results demonstrate the value of multimodal information in capturing dialogue structures and relation types. We will publicly release the dataset, annotation guidelines, and code to promote future research in multimodal dialogue understanding.
2025
Enhancing Multi-party Dialogue Discourse Parsing with Explanation Generation
Shannan Liu | Peifeng Li | Yaxin Fan | Qiaoming Zhu
Proceedings of the 31st International Conference on Computational Linguistics
Shannan Liu | Peifeng Li | Yaxin Fan | Qiaoming Zhu
Proceedings of the 31st International Conference on Computational Linguistics
Multi-party dialogue discourse parsing is an important and challenging task in natural language processing (NLP). Previous studies struggled to fully understand the deep semantics of dialogues, especially when dealing with complex topic interleaving and ellipsis. To address the above issues, we propose a novel model DDPE (Dialogue Discourse Parsing with Explanations) to integrate external knowledge from Large Language Models (LLMs), which consists of three components, i.e., explanation generation, structural parsing, and contrastive learning. DDPE employs LLMs to generate explanatory and contrastive information about discourse structure, thereby providing additional reasoning cues that enhance the understanding of dialogue semantics. The experimental results on the two public datasets STAC and Molweni show that our DDPE significantly outperforms the state-of-the-art (SOTA) baselines.