Moinul Haque
2025
From Facts to Folklore: Evaluating Large Language Models on Bengali Cultural Knowledge
Nafis Chowdhury
|
Moinul Haque
|
Anika Ahmed
|
Nazia Tasnim
|
Md. Istiak Hossain Shihab
|
Sajjadur Rahman
|
Farig Sadeque
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
Recent progress in NLP research has demonstrated remarkable capabilities of large language models (LLMs) across a wide range of tasks. While recent multilingual benchmarks have advanced cultural evaluation for LLMs, critical gaps remain in capturing the nuances of low-resource cultures. Our work addresses these limitations through a Bengali Language Cultural Knowledge (BLanCK) dataset including folk traditions, culinary arts, and regional dialects. Our investigation of several multilingual language models shows that while these models perform well in non-cultural categories, they struggle significantly with cultural knowledge and performance improves substantially across all models when context is provided, emphasizing context-aware architectures and culturally curated training data.
Search
Fix author
Co-authors
- Anika Ahmed 1
- Nafis Chowdhury 1
- Sajjadur Rahman 1
- Farig Sadeque 1
- Md. Istiak Hossain Shihab 1
- show all...