Takayuki Ito
2026
Preference Estimation via Opponent Modeling in Multi-Agent Negotiation
Yuta Konishi | Kento Yamamoto | Eisuke Sonomoto | Rikuho Takeda | Ryo Furukawa | Yusuke Muraki | Takafumi Shimizu | Kazuma Fukumura | Yuya Kanemoto | Takayuki Ito | Shiyao Ding
Findings of the Association for Computational Linguistics: ACL 2026
Yuta Konishi | Kento Yamamoto | Eisuke Sonomoto | Rikuho Takeda | Ryo Furukawa | Yusuke Muraki | Takafumi Shimizu | Kazuma Fukumura | Yuya Kanemoto | Takayuki Ito | Shiyao Ding
Findings of the Association for Computational Linguistics: ACL 2026
Automated negotiation in complex, multi-party and multi-issue settings critically depends on accurate opponent modeling. However, conventional numerical-only approaches fail to capture the qualitative information embedded in natural language interactions, resulting in unstable and incomplete preference estimation. Although Large Language Models (LLMs) enable rich semantic understanding of utterances, it remains challenging to quantitatively incorporate such information into a consistent opponent modeling. To tackle this issue, we propose a novel preference estimation method integrating natural language information into a structured Bayesian opponent modeling framework. Our approach leverages LLMs to extract qualitative cues from utterances and converts them into probabilistic formats for dynamic belief tracking. Experimental results on a multi-party benchmark demonstrate that our framework improves the full agreement rate and preference estimation accuracy by integrating probabilistic reasoning with natural language understanding.
2025
The Hidden Strength of Disagreement: Unraveling the Consensus-Diversity Tradeoff in Adaptive Multi-Agent Systems
Zengqing Wu | Takayuki Ito
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Zengqing Wu | Takayuki Ito
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Consensus formation is pivotal in multi-agent systems (MAS), balancing collective coherence with individual diversity. Conventional LLM-based MAS primarily rely on explicit coordination, e.g., prompts or voting, risking premature homogenization. We argue that implicit consensus, where agents exchange information yet independently form decisions via in-context learning, can be more effective in dynamic environments that require long-horizon adaptability. By retaining partial diversity, systems can better explore novel strategies and cope with external shocks. We formalize a consensus-diversity tradeoff, showing conditions where implicit methods outperform explicit ones. Experiments on three scenarios – Dynamic Disaster Response, Information Spread and Manipulation, and Dynamic Public-Goods Provision – confirm partial deviation from group norms boosts exploration, robustness, and performance. We highlight emergent coordination via in-context learning, underscoring the value of preserving diversity for resilient decision-making.