Interpretable ICD Code Classification with Faithful Sentence Extraction

Yichen Wang, Lian Hong, Masato Mizogaki, Shunnosuke Umeda, Toshimune Kenmotsu, Akihiro Tamura, Daniel Andrade


Abstract
Transformer-based models such as PLM-CA achieve strong performance for automatic ICD coding, but their attention weights do not provide faithful explanations of their predictions. This is a major limitation for electronic medical records, where users often need concise and trustworthy evidence for each assigned code. To address this issue, we jointly train a sentence extractor and an ICD code classifier such that predictions are based only on the extracted sentences. As a result, the extracted sentences serve as faithful rationales for each predicted code and substantially reduce the effort required to inspect long medical records. Experiments on MIMIC-III show that our method approaches the performance of a transformer baseline that processes the full record while using only a small fraction of the document.
Anthology ID:
2026.bionlp-1.54
Volume:
BioNLP 2026
Month:
July
Year:
2026
Address:
San Diego, California
Editors:
Dina Demner-Fushman, Sophia Ananiadou, Kirk Roberts, Junichi Tsujii
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
679–686
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-1.54/
DOI:
Bibkey:
Cite (ACL):
Yichen Wang, Lian Hong, Masato Mizogaki, Shunnosuke Umeda, Toshimune Kenmotsu, Akihiro Tamura, and Daniel Andrade. 2026. Interpretable ICD Code Classification with Faithful Sentence Extraction. In BioNLP 2026, pages 679–686, San Diego, California. Association for Computational Linguistics.
Cite (Informal):
Interpretable ICD Code Classification with Faithful Sentence Extraction (Wang et al., BioNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-1.54.pdf