End-to-End System for Bacteria Habitat Extraction
Farrokh Mehryary, Kai Hakala, Suwisa Kaewphan, Jari Björne, Tapio Salakoski, Filip Ginter
Abstract
We introduce an end-to-end system capable of named-entity detection, normalization and relation extraction for extracting information about bacteria and their habitats from biomedical literature. Our system is based on deep learning, CRF classifiers and vector space models. We train and evaluate the system on the BioNLP 2016 Shared Task Bacteria Biotope data. The official evaluation shows that the joint performance of our entity detection and relation extraction models outperforms the winning team of the Shared Task by 19pp on F1-score, establishing a new top score for the task. We also achieve state-of-the-art results in the normalization task. Our system is open source and freely available at https://github.com/TurkuNLP/BHE.- Anthology ID:
- W17-2310
- Volume:
- BioNLP 2017
- Month:
- August
- Year:
- 2017
- Address:
- Vancouver, Canada,
- Editors:
- Kevin Bretonnel Cohen, Dina Demner-Fushman, Sophia Ananiadou, Junichi Tsujii
- Venue:
- BioNLP
- SIG:
- SIGBIOMED
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 80–90
- Language:
- URL:
- https://aclanthology.org/W17-2310
- DOI:
- 10.18653/v1/W17-2310
- Cite (ACL):
- Farrokh Mehryary, Kai Hakala, Suwisa Kaewphan, Jari Björne, Tapio Salakoski, and Filip Ginter. 2017. End-to-End System for Bacteria Habitat Extraction. In BioNLP 2017, pages 80–90, Vancouver, Canada,. Association for Computational Linguistics.
- Cite (Informal):
- End-to-End System for Bacteria Habitat Extraction (Mehryary et al., BioNLP 2017)
- PDF:
- https://preview.aclanthology.org/landing_page/W17-2310.pdf
- Code
- TurkuNLP/BHE