Kawai Chung
2026
XToM: Exploring the Multilingual Theory of Mind for Large Language Models
Chunkit Chan | Yauwai Yim | Hongchuan Zeng | Zhiying Zou | Xinyuan Cheng | Zhifan Sun | Zheye Deng | Kawai Chung | Yuzhuo Ao | Fan Yixiang | Cheng Jiayang | Ercong Nie | Ginny Wong | Helmut Schmid | Hinrich Schuetze | Simon See | Yangqiu Song
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Chunkit Chan | Yauwai Yim | Hongchuan Zeng | Zhiying Zou | Xinyuan Cheng | Zhifan Sun | Zheye Deng | Kawai Chung | Yuzhuo Ao | Fan Yixiang | Cheng Jiayang | Ercong Nie | Ginny Wong | Helmut Schmid | Hinrich Schuetze | Simon See | Yangqiu Song
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Theory of Mind (ToM)—the ability to infer mental states in others—is pivotal for human social cognition. Existing evaluations of ToM in LLMs are largely limited to English, neglecting the linguistic diversity that shapes human cognition. This limitation raises a critical question: can LLMs exhibit Multilingual Theory of Mind—the capacity to reason about mental states across diverse linguistic contexts? To address this gap, we present XToM, a rigorously validated multilingual benchmark that evaluates ToM across five languages and incorporates diverse, contextually rich task scenarios. Using XToM, we systematically evaluate LLMs (e.g., DeepSeek R1), revealing a pronounced dissonance: while models excel in multilingual language understanding, their ToM performance varies across languages. Our findings expose limitations in LLMs’ ability to replicate human-like mentalizing across linguistic contexts.