Diversity in Unity, Theory in Practice: Hierarchical Multitask Benchmarks for Chinese Minority Languages
Yijie Li, Xi Cao, Yuan Sun, Quulgan Minggad, Abdulla Ablikim, Jia Qing Cai Wang
Abstract
Despite the rapid advancement of LLMs, their performance on linguistically and culturally diverse minority languages within a unified national context remains underexplored. We present CMiLBench, a collection of hierarchical multitask benchmarks designed to translate theoretical notions of “diversity in unity” into practical evaluation for three representative Chinese minority languages: Tibetan, Mongolian, and Uyghur. CMiLBench comprises 24,663 instances across 5 difficulty levels and 17 tasks spanning foundational ability, cultural specificity, and safety alignment. We adopt existing dataset adaptation, minority knowledge construction, and high-resource benchmark translation to construct CMiLBench. We assess 14 state-of-the-art commercial and open-source LLMs with a hybrid framework that integrates automatic metrics and LLM-as-a-Judge scoring. The comparative experimental results reveal the gap between theoretical capability and practical utility. CMiLBench serves as a foundational and scalable evaluation resource to bridge the digital language divide and promote the informatization and intelligentization of low-resource Chinese minority languages.- Anthology ID:
- 2026.acl-long.1684
- Volume:
- Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 36353–36373
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.acl-long.1684/
- DOI:
- Cite (ACL):
- Yijie Li, Xi Cao, Yuan Sun, Quulgan Minggad, Abdulla Ablikim, and Jia Qing Cai Wang. 2026. Diversity in Unity, Theory in Practice: Hierarchical Multitask Benchmarks for Chinese Minority Languages. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 36353–36373, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Diversity in Unity, Theory in Practice: Hierarchical Multitask Benchmarks for Chinese Minority Languages (Li et al., ACL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.acl-long.1684.pdf