CCL25-Eval任务7系统报告:学而不思则罔?

郑陈锐 郑陈锐; 朱奕澄 朱奕澄; 王欣雨 王欣雨; 姜伟麟 姜伟麟; 吴会腾 吴会腾

CCL25-Eval任务7系统报告:学而不思则罔?

Abstract

"以DeepSeek-R1为代表 ,"思考"普遍被认为是一种提高大语言模型性能的方法。在CCL25-Eval"争鸣"中文阅读理解任务下,本文分别探索了"思考"和"非思考"两种模型在这项任务下的潜力。具体来说,在古代文学知识理解任务中,本文构建了古汉语特定领域的知识数据集,用大模型蒸馏了思考数据集,整理了高质量思考数据集,在这些数据基础之下同样lora微调,发现思考模型虽然性能有巨大提升,但依旧比不上原本的非思考模型。最后,开源并提交了基于Qwen2.5的SongPanda模型。"

Anthology ID:: 2025.ccl-2.32
Volume:: Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025)
Month:: August
Year:: 2025
Address:: Jinan, China
Editors:: Hongfei Lin, Bin Li, Hongye Tan
Venue:: CCL
SIG:
Publisher:: Chinese Information Processing Society of China
Note:
Pages:: 271–277
Language:
URL:: https://preview.aclanthology.org/ingest-ccl/2025.ccl-2.32/
DOI:
Bibkey:
Cite (ACL):: 郑陈锐郑陈锐, 朱奕澄朱奕澄, 王欣雨王欣雨, 姜伟麟姜伟麟, and 吴会腾吴会腾. 2025. CCL25-Eval任务7系统报告:学而不思则罔?. In Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025), pages 271–277, Jinan, China. Chinese Information Processing Society of China.
Cite (Informal):: CCL25-Eval任务7系统报告:学而不思则罔? (郑陈锐 et al., CCL 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-ccl/2025.ccl-2.32.pdf

PDF Cite Search Fix data