SocioBench: Modeling Human Behavior in Sociological Surveys with Large Language Models

Jia Wang; Ziyu Zhao; Tingjuntao Ni; Zhongyu Wei (魏忠钰)

SocioBench: Modeling Human Behavior in Sociological Surveys with Large Language Models

Jia Wang, Ziyu Zhao, Tingjuntao Ni, Zhongyu Wei

Abstract

Large language models (LLMs) show strong potential for simulating human social behaviors and interactions, yet lack large-scale, systematically constructed benchmarks for evaluating their alignment with real-world social attitudes. To bridge this gap, we introduce SocioBench—a comprehensive benchmark derived from the annually collected, standardized survey data of the International Social Survey Programme (ISSP). The benchmark aggregates over 480,000 real respondent records from more than 30 countries, spanning 10 sociological domains and over 40 demographic attributes. Our experiments indicate that LLMs achieve only 30–40% accuracy when simulating individuals in complex survey scenarios, with statistically significant differences across domains and demographic subgroups. These findings highlight several limitations of current LLMs in survey scenarios, including insufficient individual-level data coverage, inadequate scenario diversity, and missing group-level modeling. We have open-sourced SocioBench at https://github.com/JiaWANG-TJ/SocioBench.

Anthology ID:: 2025.emnlp-main.1335
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 26268–26300
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1335/
DOI:
Bibkey:
Cite (ACL):: Jia Wang, Ziyu Zhao, Tingjuntao Ni, and Zhongyu Wei. 2025. SocioBench: Modeling Human Behavior in Sociological Surveys with Large Language Models. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 26268–26300, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: SocioBench: Modeling Human Behavior in Sociological Surveys with Large Language Models (Wang et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1335.pdf
Checklist:: 2025.emnlp-main.1335.checklist.pdf

PDF Cite Search Checklist Fix data