Verification-Aware Planning for Multi-Agent Systems

Tianyang Xu, Dan Zhang, Kushan Mitra, Estevam Hruschka


Abstract
Large language model (LLM) agents are increasingly deployed to tackle complex tasks, often necessitating collaboration among multiple specialized agents. However, multi-agent collaboration introduces new challenges in planning, coordination, and verification. Execution failures frequently arise not from flawed reasoning alone, but from subtle misalignments in task interpretation, output format, or inter-agent handoffs. To address these challenges, we present VeriMAP, a framework for multi-agent collaboration with verification-aware planning. The VeriMAP planner decomposes tasks, models subtask dependencies, and encodes planner-defined passing criteria as subtask verification functions (VFs) in Python and natural language. We evaluate VeriMAP on diverse datasets, demonstrating that it outperforms both single- and multi-agent baselines while enhancing system robustness and interpretability. Our analysis highlights how verification-aware planning enables reliable coordination and iterative refinement in multi-agent systems, without relying on external labels or annotations.
Anthology ID:
2026.eacl-long.353
Volume:
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7528–7546
Language:
URL:
https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.353/
DOI:
Bibkey:
Cite (ACL):
Tianyang Xu, Dan Zhang, Kushan Mitra, and Estevam Hruschka. 2026. Verification-Aware Planning for Multi-Agent Systems. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 7528–7546, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Verification-Aware Planning for Multi-Agent Systems (Xu et al., EACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.353.pdf