Abstract
Shallow text analysis (Text Mining) uses mainly Information Extraction techniques. The low resource languages do not allow application of such traditional techniques with sufficient accuracy and recall on big data. In contrast, Data Mining approaches provide an opportunity to make deep analysis and to discover new knowledge. Frequent pattern mining approaches are used mainly for structured information in databases and are a quite challenging task in text mining. Unfortunately, most frequent pattern mining approaches do not use contextual information for extracted patterns: general patterns are extracted regardless of the context. We propose a method that processes raw informal texts (from health discussion forums) and formal texts (outpatient records) in Bulgarian language. In addition we use some context information and small terminological lexicons to generalize extracted frequent patterns. This allows to map informal expression of medical terminology to the formal one and to generate automatically resources.- Anthology ID:
- R17-1019
- Volume:
- Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
- Month:
- September
- Year:
- 2017
- Address:
- Varna, Bulgaria
- Editors:
- Ruslan Mitkov, Galia Angelova
- Venue:
- RANLP
- SIG:
- Publisher:
- INCOMA Ltd.
- Note:
- Pages:
- 130–138
- Language:
- URL:
- https://doi.org/10.26615/978-954-452-049-6_019
- DOI:
- 10.26615/978-954-452-049-6_019
- Cite (ACL):
- Svetla Boytcheva, Ivelina Nikolova, and Galia Angelova. 2017. Mining Association Rules from Clinical Narratives. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 130–138, Varna, Bulgaria. INCOMA Ltd..
- Cite (Informal):
- Mining Association Rules from Clinical Narratives (Boytcheva et al., RANLP 2017)
- PDF:
- https://doi.org/10.26615/978-954-452-049-6_019