Datasets and Methods for Improving the Cultural Capabilities of NLP Systems: A Survey

Tania Chakraborty, Eylon Caplan, Zhaoqing Wu, Kevin Cushing, Bruce Qin, Shreya Havaldar, Dan Goldwasser


Abstract
In recent years, there has been a surge of interest in Cultural NLP, with substantial efforts to create globally inclusive NLP systems. The rapid growth of literature in this field makes it difficult to track trends in methods and data resources. To address this, we survey over 375 papers to answer three complementary questions: (1) What Cultural Capabilities (CCs) are being targeted in NLP systems? (2) How are cultural data resources being created? and (3) What methods are being used to improve the CCs of those systems? We discuss trends observed across the three questions, and identify relevant research gaps. To facilitate further research in this field, we release our full list of surveyed papers, in the form of an interactive web interface, CultureMine, which includes a feature to allow researchers to add their work; we hope this facilitates future research and proves to be a valuable resource for the Cultural NLP community.
Anthology ID:
2026.nlpcss-1.14
Volume:
Proceedings of the Seventh Workshop on Natural Language Processing and Computational Social Science
Month:
July
Year:
2026
Address:
San Diego
Editors:
Dallas Card, Anjalie Field, Katherine Keith, Julia Mendelsohn
Venues:
NLP+CSS | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
198–248
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.nlpcss-1.14/
DOI:
Bibkey:
Cite (ACL):
Tania Chakraborty, Eylon Caplan, Zhaoqing Wu, Kevin Cushing, Bruce Qin, Shreya Havaldar, and Dan Goldwasser. 2026. Datasets and Methods for Improving the Cultural Capabilities of NLP Systems: A Survey. In Proceedings of the Seventh Workshop on Natural Language Processing and Computational Social Science, pages 198–248, San Diego. Association for Computational Linguistics.
Cite (Informal):
Datasets and Methods for Improving the Cultural Capabilities of NLP Systems: A Survey (Chakraborty et al., NLP+CSS 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.nlpcss-1.14.pdf