Predicting ICU Length of Stay for Patients using Latent Categorization of Health Conditions

Tirthankar Dasgupta, Manjira Sinha, Sudeshna Jana


Abstract
Predicting the duration of a patient’s stay in an Intensive Care Unit (ICU) is a critical challenge for healthcare administrators, as it impacts resource allocation, staffing, and patient care strategies. Traditional approaches often rely on structured clinical data, but recent developments in language models offer significant potential to utilize unstructured text data such as nursing notes, discharge summaries, and clinical reports for ICU length-of-stay (LoS) predictions. In this study, we introduce a method for analyzing nursing notes to predict the remaining ICU stay duration of patients. Our approach leverages a joint model of latent note categorization, which identifies key health-related patterns and disease severity factors from unstructured text data. This latent categorization enables the model to derive high-level insights that influence patient care planning. We evaluate our model on the widely used MIMIC-III dataset, and our preliminary findings show that it significantly outperforms existing baselines, suggesting promising industrial applications for resource optimization and operational efficiency in healthcare settings.
Anthology ID:
2025.naacl-industry.35
Volume:
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track)
Month:
April
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Weizhu Chen, Yi Yang, Mohammad Kachuee, Xue-Yong Fu
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
422–430
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.naacl-industry.35/
DOI:
Bibkey:
Cite (ACL):
Tirthankar Dasgupta, Manjira Sinha, and Sudeshna Jana. 2025. Predicting ICU Length of Stay for Patients using Latent Categorization of Health Conditions. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track), pages 422–430, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
Predicting ICU Length of Stay for Patients using Latent Categorization of Health Conditions (Dasgupta et al., NAACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.naacl-industry.35.pdf