ALIGN: Word Association Learning for Cultural Alignment in Large Language Models

Chunhua Liu; Kabir Manandhar Shrestha; Sukai Huang

ALIGN: Word Association Learning for Cultural Alignment in Large Language Models

Chunhua Liu, Kabir Manandhar Shrestha, Sukai Huang

Abstract

Large language models (LLMs) exhibit cultural bias from over-represented viewpoints in training data, yet cultural alignment remains a challenge due to limited cultural knowledge and a lack of exploration into effective learning approaches. We introduce a cost-efficient and cognitively grounded method: fine-tuning LLMs on native speakers’ word-association norms, leveraging cognitive psychology findings that such associations capture cultural knowledge. Using word association datasets from native speakers in the US (English) and China (Mandarin), we train Llama-3.1-8B and Qwen-2.5-7B via supervised fine-tuning and preference optimization. We evaluate models’ cultural alignment through a two-tier evaluation framework that spans low-level lexical associations and high-level cultural value alignment using the World Values Survey. Results show significant improvements in lexical alignment (16–20% English, 43–165% Mandarin on Precision@5) and high-level cultural value shifts. On a subset of 50 questions where US and Chinese respondents diverge most, fine-tuned Qwen nearly doubles its response alignment with Chinese values (13 to 25). Remarkably, our trained 7–8B models match or exceed vanilla 70B baselines, demonstrating that a few million of culture-grounded associations achieve value alignment without expensive retraining. Our work highlights both the promise and the need for future research grounded in human cognition in improving cultural alignment in AI models.

Anthology ID:: 2026.acl-long.130
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2857–2879
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.130/
DOI:
Bibkey:
Cite (ACL):: Chunhua Liu, Kabir Manandhar Shrestha, and Sukai Huang. 2026. ALIGN: Word Association Learning for Cultural Alignment in Large Language Models. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2857–2879, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ALIGN: Word Association Learning for Cultural Alignment in Large Language Models (Liu et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.130.pdf
Checklist:: 2026.acl-long.130.checklist.pdf

PDF Cite Search Checklist Fix data