System Report for CCL25-Eval Task 8: Improving ICD Coding with Large Language Models via Disease Entity Recognition

Tengxiao Lv, Juntao Li, Chao Liu, Haobin Yuan, Ling Luo, Jian Wang, Hongfei Lin


Abstract
"With the widespread adoption of Electronic Medical Records (EMRs), automated coding of theInternational Classification of Diseases (ICD) has become increasingly essential. However, the complexity of Chinese clinical texts presents significant challenges to traditional methods. To address these issues, CCL25-Eval Task 8 organized the Chinese EMRs ICD Diagnosis CodingEvaluation. This paper presents a method based on Large Language Models (LLMs), which divides the task into primary and other diagnosis coding. For the primary diagnosis, a confidence-guided semantic retrieval strategy is applied, while ensemble learning enhanced with NamedEntity Recognition (NER) is used for other diagnoses. The proposed approach achieved 83.42%accuracy on the official test set, ranking second in the evaluation."
Anthology ID:
2025.ccl-2.36
Volume:
Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025)
Month:
August
Year:
2025
Address:
Jinan, China
Editors:
Hongfei Lin, Bin Li, Hongye Tan
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
304–311
Language:
URL:
https://preview.aclanthology.org/ingest-ccl/2025.ccl-2.36/
DOI:
Bibkey:
Cite (ACL):
Tengxiao Lv, Juntao Li, Chao Liu, Haobin Yuan, Ling Luo, Jian Wang, and Hongfei Lin. 2025. System Report for CCL25-Eval Task 8: Improving ICD Coding with Large Language Models via Disease Entity Recognition. In Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025), pages 304–311, Jinan, China. Chinese Information Processing Society of China.
Cite (Informal):
System Report for CCL25-Eval Task 8: Improving ICD Coding with Large Language Models via Disease Entity Recognition (Lv et al., CCL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-ccl/2025.ccl-2.36.pdf