基于提示探针的大模型知识掌握能力评测

Wang Chunyu; Bo Chen (陈波); Yang Xu; Xiaobing Zhao

基于提示探针的大模型知识掌握能力评测

Wang Chunyu, Bo Chen, Yang Xu, Xiaobing Zhao

Abstract

"大语言模型在知识密集型任务中的表现高度依赖其内化知识的覆盖面和掌握程度。然而,当前缺乏系统化、细粒度的评测方法以刻画模型对不同类别知识的掌握能力。为此,本文提出一种基于提示探针的方法,系统评估大语言模型在常识性知识、事实性知识和专业领域知识方面的掌握情况。首先构建了一个高质量的知识探针评测数据集KPE-Pro(Knowledge Probing Evaluation for Proficiency)。然后设计提示模板对多个主流大语言模型进行系统评测。评测结果表明,大语言模型在常识性知识方面表现较好,ERNIE X1模型取得整体最好成绩;在事实性知识上,大语言模型的表现较弱,轻量模型的知识掌握能力明显不足。评测数据公开于:https://github.com/cyuu313/KPE-Pro。"

Anthology ID:: 2025.ccl-1.24
Volume:: Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025)
Month:: August
Year:: 2025
Address:: Jinan, China
Editors:: Maosong Sun, Peiyong Duan, Zhiyuan Liu, Ruifeng Xu, Weiwei Sun
Venue:: CCL
SIG:
Publisher:: Chinese Information Processing Society of China
Note:
Pages:: 311–322
Language:
URL:: https://preview.aclanthology.org/ingest-ccl/2025.ccl-1.24/
DOI:
Bibkey:
Cite (ACL):: Wang Chunyu, Bo Chen, Yang Xu, and Xiaobing Zhao. 2025. 基于提示探针的大模型知识掌握能力评测. In Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025), pages 311–322, Jinan, China. Chinese Information Processing Society of China.
Cite (Informal):: 基于提示探针的大模型知识掌握能力评测 (Chunyu et al., CCL 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-ccl/2025.ccl-1.24.pdf

PDF Cite Search Fix data