ProToM: Promoting Prosocial Behaviour via Theory of Mind-Informed Feedback

Matteo Bortoletto, Yichao Zhou, Lance Ying, Tianmin Shu, Andreas Bulling


Abstract
While humans are inherently social creatures, the challenge of identifying when and how to assist and collaborate with others - particularly when pursuing independent goals - can hinder cooperation. To address this challenge, we aim to develop an AI system that provides useful feedback to promote prosocial behaviour - actions that benefit others, even when not directly aligned with one’s own goals. We introduce ProToM, a Theory of Mind-informed facilitator that promotes prosocial actions in multi-agent systems by providing targeted, context-sensitive feedback to individual agents. ProToM first infers agents’ goals using Bayesian inverse planning, then selects feedback to communicate by maximising expected utility, conditioned on the inferred goal distribution. We evaluate our approach against baselines in two multi-agent environments: Doors, Keys, and Gems, as well as Overcooked. Our results suggest that state-of-the-art large language and reasoning models fall short of communicating feedback that is both contextually grounded and well-timed - leading to higher communication overhead and lower success rates. In contrast, ProToM provides targeted and helpful feedback, achieving a higher success rate, shorter task completion times, and is consistently preferred by human users.
Anthology ID:
2026.findings-acl.1331
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
26715–26734
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1331/
DOI:
Bibkey:
Cite (ACL):
Matteo Bortoletto, Yichao Zhou, Lance Ying, Tianmin Shu, and Andreas Bulling. 2026. ProToM: Promoting Prosocial Behaviour via Theory of Mind-Informed Feedback. In Findings of the Association for Computational Linguistics: ACL 2026, pages 26715–26734, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
ProToM: Promoting Prosocial Behaviour via Theory of Mind-Informed Feedback (Bortoletto et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1331.pdf
Checklist:
 2026.findings-acl.1331.checklist.pdf