MuTual: A Dataset for Multi-Turn Dialogue Reasoning

Leyang Cui; Yu Wu; Shujie Liu; Yue Zhang; Ming Zhou

doi:10.18653/v1/2020.acl-main.130

MuTual: A Dataset for Multi-Turn Dialogue Reasoning

Leyang Cui, Yu Wu, Shujie Liu, Yue Zhang, Ming Zhou

Abstract

Non-task oriented dialogue systems have achieved great success in recent years due to largely accessible conversation data and the development of deep learning techniques. Given a context, current systems are able to yield a relevant and fluent response, but sometimes make logical mistakes because of weak reasoning capabilities. To facilitate the conversation reasoning research, we introduce MuTual, a novel dataset for Multi-Turn dialogue Reasoning, consisting of 8,860 manually annotated dialogues based on Chinese student English listening comprehension exams. Compared to previous benchmarks for non-task oriented dialogue systems, MuTual is much more challenging since it requires a model that be able to handle various reasoning problems. Empirical results show that state-of-the-art methods only reach 71%, which is far behind human performance of 94%, indicating that there is ample room for improving reasoning ability.

Anthology ID:: 2020.acl-main.130
Volume:: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Month:: July
Year:: 2020
Address:: Online
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1406–1416
Language:
URL:: https://aclanthology.org/2020.acl-main.130
DOI:: 10.18653/v1/2020.acl-main.130
Bibkey:
Cite (ACL):: Leyang Cui, Yu Wu, Shujie Liu, Yue Zhang, and Ming Zhou. 2020. MuTual: A Dataset for Multi-Turn Dialogue Reasoning. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 1406–1416, Online. Association for Computational Linguistics.
Cite (Informal):: MuTual: A Dataset for Multi-Turn Dialogue Reasoning (Cui et al., ACL 2020)
Copy Citation:
PDF:: https://preview.aclanthology.org/starsem-semeval-split/2020.acl-main.130.pdf
Dataset:: 2020.acl-main.130.Dataset.zip
Video:: http://slideslive.com/38928987
Code: Nealcly/MuTual
Data: MuTual, CoQA, CommonsenseQA, DREAM, DROP, Douban, RACE, SWAG, WSC

PDF Search Code Dataset Video