From Individual Excellence to Collective Sustainability: Seeking Strategic Equilibrium in Proactive Multi-Agent Teams

Tong Zhang, Yang Wu, Yufei Shi, Rujing Yao, Zhuoren Jiang, Xiaozhong Liu


Abstract
In heterogeneous scientific teams, proactive team agents can serve as effective assistants regarding the research progress of the project. However, proactive agents always suffer from collaborative myopia: a greedy optimization for immediate task accuracy which ignore the long-term goal of team sustainability. This leads to the Individual-centric Trap, where capable experts (e.g., PIs) are disproportionately overloaded while Junior roles remain underutilized. Therefore, neglecting opportunity costs in task allocation can implicitly erodes the enduring performance of the team. To solve this imbalance between efficiency and sustainability, we propose GT-PMARL (Game-Theoretic Proactive Multi-Agent Reinforcement Learning). By internalizing the opportunity cost as a key consideration in individual decision-making, the collaboration logic of agents has been reshaped. Our framework employs: (1) a Positive-Unlabeled scorer to anchor intervention quality under sparse supervision; (2) a Nash-Pareto competitive objective to seek an equilibrium between individual task excellence and collective load balancing. Empirical experiments in scientific workflows show that GT-PMARL effectively maintains high performance while preventing experts from over-developing. Our work provides a scalable paradigm for building a sustainable and balanced human-AI collaborative ecosystem.
Anthology ID:
2026.findings-acl.1920
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
38552–38565
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1920/
DOI:
Bibkey:
Cite (ACL):
Tong Zhang, Yang Wu, Yufei Shi, Rujing Yao, Zhuoren Jiang, and Xiaozhong Liu. 2026. From Individual Excellence to Collective Sustainability: Seeking Strategic Equilibrium in Proactive Multi-Agent Teams. In Findings of the Association for Computational Linguistics: ACL 2026, pages 38552–38565, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
From Individual Excellence to Collective Sustainability: Seeking Strategic Equilibrium in Proactive Multi-Agent Teams (Zhang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1920.pdf
Checklist:
 2026.findings-acl.1920.checklist.pdf