ProactiveEval: A Unified Evaluation Framework for Proactive Dialogue Agents

Tianjian Liu; Fanqi Wan; Jiajian Guo; Xiaojun Quan

ProactiveEval: A Unified Evaluation Framework for Proactive Dialogue Agents

Tianjian Liu, Fanqi Wan, Jiajian Guo, Xiaojun Quan

Abstract

Proactive dialogue has emerged as a critical and challenging research problem in advancing large language models (LLMs). Existing works predominantly focus on domain-specific or task-oriented scenarios, which leads to fragmented evaluations and limits the comprehensive exploration of models’ proactive dialogue abilities. In this work, we propose ProactiveEval, a unified framework for evaluating proactive dialogue capabilities of LLMs. This framework decomposes proactive dialogue into target planning and dialogue guidance, establishing evaluation metrics across various domains. Moreover, it also enables the automatic generation of diverse and challenging evaluation data. Based on the proposed framework, we develop 328 evaluation environments spanning 6 distinct domains. Through experiments with 22 different types of LLMs, we show that DeepSeek-R1 and Claude-3.7-Sonnet exhibit exceptional performance on target planning and dialogue guidance tasks, respectively. Finally, we investigate how reasoning capabilities influence proactive behaviors and discuss their implications for future model development. Our code and data are available at the https://github.com/liutj9/ProactiveEval.

Anthology ID:: 2026.acl-long.1906
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 41068–41100
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1906/
DOI:
Bibkey:
Cite (ACL):: Tianjian Liu, Fanqi Wan, Jiajian Guo, and Xiaojun Quan. 2026. ProactiveEval: A Unified Evaluation Framework for Proactive Dialogue Agents. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 41068–41100, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ProactiveEval: A Unified Evaluation Framework for Proactive Dialogue Agents (Liu et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1906.pdf
Checklist:: 2026.acl-long.1906.checklist.pdf

PDF Cite Search Checklist Fix data