NeuralFSM: Adaptive Multi-Agent Coordination via Learning Finite-State Execution Policy

Jiye Wang, Yu Wang, Jianbin Li, Shiduo Yang, Kenan Guo, Yuanhe Zhao


Abstract
LLM-powered multi-agent systems (MAS) have demonstrated strong performance on complex tasks. However, most existing approaches still rely on hand-crafted communication protocols or automatically designed communication topologies, which generalize poorly across tasks. We introduce NeuralFSM, a state-driven framework that formulates multi-agent problem solving as a finite-state execution process. NeuralFSM learns both the state transition distribution and inter-agent communication weights from interaction traces using a Temporal Coordination Controller. Rather than prioritizing explicit structure generation, the proposed framework uses task context to modulate transition and routing decisions, enabling flexible coordination without manual protocol design. To improve robustness against noisy or adversarial agents, we incorporate graph regularization during training and apply trust-aware message attenuation at runtime. Experiments on diverse benchmarks show that NeuralFSM consistently outperforms prior baselines by an average margin of 6.74%–19.39%, while substantially reducing token consumption. Moreover, NeuralFSM exhibits strong inherent robustness, which is further enhanced by the protection layer, resulting in only a 1.82% performance drop under attack.
Anthology ID:
2026.acl-long.1543
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
33414–33436
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1543/
DOI:
Bibkey:
Cite (ACL):
Jiye Wang, Yu Wang, Jianbin Li, Shiduo Yang, Kenan Guo, and Yuanhe Zhao. 2026. NeuralFSM: Adaptive Multi-Agent Coordination via Learning Finite-State Execution Policy. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 33414–33436, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
NeuralFSM: Adaptive Multi-Agent Coordination via Learning Finite-State Execution Policy (Wang et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1543.pdf
Checklist:
 2026.acl-long.1543.checklist.pdf