Abstract
A large percentage of medical information is in unstructured text format in electronic medical record systems. Manual extraction of information from clinical notes is extremely time consuming. Natural language processing has been widely used in recent years for automatic information extraction from medical texts. However, algorithms trained on data from a single healthcare provider are not generalizable and error-prone due to the heterogeneity and uniqueness of medical documents. We develop a two-stage federated natural language processing method that enables utilization of clinical notes from different hospitals or clinics without moving the data, and demonstrate its performance using obesity and comorbities phenotyping as medical task. This approach not only improves the quality of a specific clinical task but also facilitates knowledge progression in the whole healthcare system, which is an essential part of learning health system. To the best of our knowledge, this is the first application of federated machine learning in clinical NLP.- Anthology ID:
- W19-5030
- Original:
- W19-5030v1
- Version 2:
- W19-5030v2
- Volume:
- Proceedings of the 18th BioNLP Workshop and Shared Task
- Month:
- August
- Year:
- 2019
- Address:
- Florence, Italy
- Editors:
- Dina Demner-Fushman, Kevin Bretonnel Cohen, Sophia Ananiadou, Junichi Tsujii
- Venue:
- BioNLP
- SIG:
- SIGBIOMED
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 283–291
- Language:
- URL:
- https://aclanthology.org/W19-5030
- DOI:
- 10.18653/v1/W19-5030
- Cite (ACL):
- Dianbo Liu, Dmitriy Dligach, and Timothy Miller. 2019. Two-stage Federated Phenotyping and Patient Representation Learning. In Proceedings of the 18th BioNLP Workshop and Shared Task, pages 283–291, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Two-stage Federated Phenotyping and Patient Representation Learning (Liu et al., BioNLP 2019)
- PDF:
- https://preview.aclanthology.org/ingest-acl-2023-videos/W19-5030.pdf
- Data
- MIMIC-III