Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense

Siqi Shen; Lajanugen Logeswaran; Moontae Lee; Honglak Lee; Soujanya Poria; Rada Mihalcea

doi:10.18653/v1/2024.naacl-long.316

Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense

Siqi Shen, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Soujanya Poria, Rada Mihalcea

Abstract

Large language models (LLMs) have demonstrated substantial commonsense understanding through numerous benchmark evaluations. However, their understanding of cultural commonsense remains largely unexamined. In this paper, we conduct a comprehensive examination of the capabilities and limitations of several state-of-the-art LLMs in the context of cultural commonsense tasks. Using several general and cultural commonsense benchmarks, we find that (1) LLMs have a significant discrepancy in performance when tested on culture-specific commonsense knowledge for different cultures; (2) LLMs’ general commonsense capability is affected by cultural context; and (3) The language used to query the LLMs can impact their performance on cultural-related tasks.Our study points to the inherent bias in the cultural understanding of LLMs and provides insights that can help develop culturally-aware language models.

Anthology ID:: 2024.naacl-long.316
Volume:: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Kevin Duh, Helena Gomez, Steven Bethard
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5668–5680
Language:
URL:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.naacl-long.316/
DOI:: 10.18653/v1/2024.naacl-long.316
Bibkey:
Cite (ACL):: Siqi Shen, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Soujanya Poria, and Rada Mihalcea. 2024. Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 5668–5680, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense (Shen et al., NAACL 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.naacl-long.316.pdf
Video:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.naacl-long.316.mp4

PDF Cite Search Video Fix data