Mapping of Narrative Text Fields To ICD-10 Codes Using Natural Language Processing and Machine Learning

Risuna Nkolele

doi:10.18653/v1/2020.winlp-1.35

Mapping of Narrative Text Fields To ICD-10 Codes Using Natural Language Processing and Machine Learning

Abstract

The assignment of ICD-10 codes is done manually, which is laborious and prone to errors. The use of natural language processing and machine learning approaches have been receiving increasing attention on automating the task of assigning ICD-10 codes. In this study, we investigate the effect of different approaches on automating the task of assigning ICD-10 codes. To do this we use the South African clinical dataset containing three narrative text fields (Clinical Summary, Presenting Complaints, and Examination Findings). The following traditional machine learning algorithms, namely: Logistic Regression, Multinomial Naive Bayes, Support Vector Machine, Decision Tree, RandomForest, and Extreme Gradient Boost were used as our classifiers. Our study results show the strong potential of automated ICD-10 coding from the narrative text fields. ExtremeGradient Boost outperformed other classifiers in automating the task of assigning ICD-10 codes based on the three narrative text fields with an accuracy of 79%, precision of75%, and recall of 78%. While our worst classifier (Decision Tree) achieved the accuracy of 54%, precision of 60% and recall of 56%.

Anthology ID:: 2020.winlp-1.35
Volume:: Proceedings of the The Fourth Widening Natural Language Processing Workshop
Month:: July
Year:: 2020
Address:: Seattle, USA
Venues:: ACL | WS | WiNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 131–135
Language:
URL:: https://aclanthology.org/2020.winlp-1.35
DOI:: 10.18653/v1/2020.winlp-1.35
Bibkey:
Cite (ACL):: Risuna Nkolele. 2020. Mapping of Narrative Text Fields To ICD-10 Codes Using Natural Language Processing and Machine Learning. In Proceedings of the The Fourth Widening Natural Language Processing Workshop, pages 131–135, Seattle, USA. Association for Computational Linguistics.
Cite (Informal):: Mapping of Narrative Text Fields To ICD-10 Codes Using Natural Language Processing and Machine Learning (Nkolele, WiNLP 2020)
Copy Citation:
Video:: http://slideslive.com/38929575

Cite Search Video