AwarenessBench: Assessing Cognitive Capabilities of Language Models

Xiaojian Li; Rongwu Xu; Tianyun Zhang; Yue Wang; Shuo Chen; Qiner Lyu; Briana Zhang; Peiran Yang; Kyle Xue Chen; Haoyuan Shi; Yu Wang; Wei Xu

AwarenessBench: Assessing Cognitive Capabilities of Language Models

Xiaojian Li, Rongwu Xu, Tianyun Zhang, Yue Wang, Shuo Chen, Qiner Lyu, Briana Zhang, Peiran Yang, Kyle Xue Chen, Haoyuan Shi, Yu Wang, Wei Xu

Abstract

As language models (LMs) exhibit increasingly consciousness-like behaviors, evaluating their cognitive abilities becomes essential. We introduce AwarenessBench, the first comprehensive benchmark for assessing the cognitive abilities of LMs in four dimensions: metacognition, self-awareness, social awareness, and situational awareness, covering 15 cognitive functions and 14,381 samples. Evaluating 18 state-of-the-art LMs, we find that all consistently surpass random baselines, with more advanced models performing better. We further compare LMs with human performance across three demographic groups, where the best-performing model surpasses human averages overall, but most still fall markedly short in metacognition and self-awareness. Finally, we show that awareness is a distinct capability: progress in language modeling or reasoning does not necessarily translate into improved cognition.

Anthology ID:: 2026.acl-long.124
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2682–2741
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.124/
DOI:
Bibkey:
Cite (ACL):: Xiaojian Li, Rongwu Xu, Tianyun Zhang, Yue Wang, Shuo Chen, Qiner Lyu, Briana Zhang, Peiran Yang, Kyle Xue Chen, Haoyuan Shi, Yu Wang, and Wei Xu. 2026. AwarenessBench: Assessing Cognitive Capabilities of Language Models. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2682–2741, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: AwarenessBench: Assessing Cognitive Capabilities of Language Models (Li et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.124.pdf
Checklist:: 2026.acl-long.124.checklist.pdf

PDF Cite Search Checklist Fix data