Abstract
There are millions of articles in PubMed database. To facilitate information retrieval, curators in the National Library of Medicine (NLM) assign a set of Medical Subject Headings (MeSH) to each article. MeSH is a hierarchically-organized vocabulary, containing about 28K different concepts, covering the fields from clinical medicine to information sciences. Several automatic MeSH indexing models have been developed to improve the time-consuming and financially expensive manual annotation, including the NLM official tool – Medical Text Indexer, and the winner of BioASQ Task5a challenge – DeepMeSH. However, these models are complex and not interpretable. We propose a novel end-to-end model, AttentionMeSH, which utilizes deep learning and attention mechanism to index MeSH terms to biomedical text. The attention mechanism enables the model to associate textual evidence with annotations, thus providing interpretability at the word level. The model also uses a novel masking mechanism to enhance accuracy and speed. In the final week of BioASQ Chanllenge Task6a, we ranked 2nd by average MiF using an on-construction model. After the contest, we achieve close to state-of-the-art MiF performance of ∼ 0.684 using our final model. Human evaluations show AttentionMeSH also provides high level of interpretability, retrieving about 90% of all expert-labeled relevant words given an MeSH-article pair at 20 output.- Anthology ID:
- W18-5306
- Volume:
- Proceedings of the 6th BioASQ Workshop A challenge on large-scale biomedical semantic indexing and question answering
- Month:
- November
- Year:
- 2018
- Address:
- Brussels, Belgium
- Editors:
- Ioannis A. Kakadiaris, George Paliouras, Anastasia Krithara
- Venue:
- BioASQ
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 47–56
- Language:
- URL:
- https://aclanthology.org/W18-5306
- DOI:
- 10.18653/v1/W18-5306
- Cite (ACL):
- Qiao Jin, Bhuwan Dhingra, William Cohen, and Xinghua Lu. 2018. AttentionMeSH: Simple, Effective and Interpretable Automatic MeSH Indexer. In Proceedings of the 6th BioASQ Workshop A challenge on large-scale biomedical semantic indexing and question answering, pages 47–56, Brussels, Belgium. Association for Computational Linguistics.
- Cite (Informal):
- AttentionMeSH: Simple, Effective and Interpretable Automatic MeSH Indexer (Jin et al., BioASQ 2018)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/W18-5306.pdf
- Data
- BioASQ