FrameNet-Cultures: A Benchmark for Evaluating LLMs via Cross-Cultural Frame Semantics

Neda Jamshidi, Anders S{\o}gaard, Monica Bianchini


Abstract
Large language models (LLMs) exhibit cultural biases, yet existing benchmarks rely on closed-form, domain-specific questionnaires. We introduce FRAMENET-CULTURES, a benchmark for evaluating cultural alignment in LLMs based on Fillmore-style frame semantics. Using the EveryCulture encyclopedia, we construct a lexicon of 18 cultural frames (e.g., greeting,child-rearing) across 20 countries, treating it as a structured reference for comparison rather than a definitive representation of contemporary societies. For each frame, we prompt five major LLMs—ChatGPT-5, Gemini-2.5-Flash, Mistral-Large, Qwen-3-Max, DeepSeek-V3.2—three times to generate open-ended instantiations, which are manually annotated and binarized. We measure alignment with country- and continent-level profiles using normalized Hamming distance, and validate cultural recognizability through human evaluation of generated dialogues. Under culture-neutral prompting, outputs align most closely with European profiles, followed by Asian and American ones, indicating a consistent cross-model pattern. With culture-specific prompting, models shift toward the target regions, aligning most strongly with Africa for Ethiopia and with Asia for India. FRAMENET-CULTURES is the first open-ended benchmark for cultural alignment relying on frame semantics. Data, prompts, and annotations are publicly available at https://github.com/neda-jamshidi/FrameNet-Cultures.
Anthology ID:
2026.findings-acl.491
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10090–10131
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.491/
DOI:
Bibkey:
Cite (ACL):
Neda Jamshidi, Anders S{\o}gaard, and Monica Bianchini. 2026. FrameNet-Cultures: A Benchmark for Evaluating LLMs via Cross-Cultural Frame Semantics. In Findings of the Association for Computational Linguistics: ACL 2026, pages 10090–10131, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
FrameNet-Cultures: A Benchmark for Evaluating LLMs via Cross-Cultural Frame Semantics (Jamshidi et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.491.pdf
Checklist:
 2026.findings-acl.491.checklist.pdf