Abstract
Medical literature is a vast and constantly expanding source of information about diseases, their diagnoses and treatments. One of the ways to extract insights from this type of data is through mining association rules between such entities. However, existing solutions do not take into account the semantics of sentences from which entity co-occurrences are extracted. We propose a scalable solution for the automated discovery of semantic associations between different entities such as diseases and their symptoms. Our approach employs the UMLS semantic network and a binary relation classification model trained with distant supervision to validate and help ranking the most likely entity associations pairs extracted with frequency-based association rule mining algorithms. We evaluate the proposed system on the task of extracting disease-symptom associations from a collection of over 14M PubMed abstracts and validate our results against a publicly available known list of disease-symptom pairs.- Anthology ID:
- 2022.emnlp-industry.6
- Volume:
- Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track
- Month:
- December
- Year:
- 2022
- Address:
- Abu Dhabi, UAE
- Editors:
- Yunyao Li, Angeliki Lazaridou
- Venue:
- EMNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 77–89
- Language:
- URL:
- https://aclanthology.org/2022.emnlp-industry.6
- DOI:
- 10.18653/v1/2022.emnlp-industry.6
- Cite (ACL):
- Alberto Purpura, Francesca Bonin, and Joao Bettencourt-silva. 2022. Accelerating the Discovery of Semantic Associations from Medical Literature: Mining Relations Between Diseases and Symptoms. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 77–89, Abu Dhabi, UAE. Association for Computational Linguistics.
- Cite (Informal):
- Accelerating the Discovery of Semantic Associations from Medical Literature: Mining Relations Between Diseases and Symptoms (Purpura et al., EMNLP 2022)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2022.emnlp-industry.6.pdf