大模型时代的多语言研究综述(A Survey of Multilingual Research in the Large Language Model Era)
Changjiang Gao (长江 高), Hao Zhou (昊 周), Shuaijie She (佘帅杰), Haoming Zhong (钟昊鸣), Sizhe Liu (斯哲 刘), Zhejian Lai (赖哲剑), Zhijun Wang (王志军), Shujian Huang (书剑 黄)
Abstract
“进入大语言模型时代以来,传统的多语言研究模式发生了巨大变化。一些传统任务得到了突破性的解决,也出现了多种新任务,以及许多以多语言大模型为基础、面向大模型能力提升的多语言研究工作。本文针对研究领域中的这一新变化,整理归纳了进入大模型时代以来的多语言研究进展,包括多语言大模型、数据集、任务,以及相关的前沿研究方向、研究挑战等,希望能为大模型范式下的多语言研究的未来发展提供参考和帮助。”- Anthology ID:
- 2024.ccl-2.4
- Volume:
- Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 2: Frontier Forum)
- Month:
- July
- Year:
- 2024
- Address:
- Taiyuan, China
- Editor:
- Zhao Xin
- Venue:
- CCL
- SIG:
- Publisher:
- Chinese Information Processing Society of China
- Note:
- Pages:
- 63–85
- Language:
- Chinese
- URL:
- https://preview.aclanthology.org/tal-24-ingestion/2024.ccl-2.4/
- DOI:
- Cite (ACL):
- Changjiang Gao, Hao Zhou, Shuaijie She, Haoming Zhong, Sizhe Liu, Zhejian Lai, Zhijun Wang, and Shujian Huang. 2024. 大模型时代的多语言研究综述(A Survey of Multilingual Research in the Large Language Model Era). In Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 2: Frontier Forum), pages 63–85, Taiyuan, China. Chinese Information Processing Society of China.
- Cite (Informal):
- 大模型时代的多语言研究综述(A Survey of Multilingual Research in the Large Language Model Era) (Gao et al., CCL 2024)
- PDF:
- https://preview.aclanthology.org/tal-24-ingestion/2024.ccl-2.4.pdf