SAFFRON: tranSfer leArning For Food-disease RelatiOn extractioN

Gjorgjina Cenikj, Tome Eftimov, Barbara Koroušić Seljak


Abstract
The accelerating growth of big data in the biomedical domain, with an endless amount of electronic health records and more than 30 million citations and abstracts in PubMed, introduces the need for automatic structuring of textual biomedical data. In this paper, we develop a method for detecting relations between food and disease entities from raw text. Due to the lack of annotated data on food with respect to health, we explore the feasibility of transfer learning by training BERT-based models on existing datasets annotated for the presence of cause and treat relations among different types of biomedical entities, and using them to recognize the same relations between food and disease entities in a dataset created for the purposes of this study. The best models achieve macro averaged F1 scores of 0.847 and 0.900 for the cause and treat relations, respectively.
Anthology ID:
2021.bionlp-1.4
Volume:
Proceedings of the 20th Workshop on Biomedical Language Processing
Month:
June
Year:
2021
Address:
Online
Venue:
BioNLP
SIG:
SIGBIOMED
Publisher:
Association for Computational Linguistics
Note:
Pages:
30–40
Language:
URL:
https://aclanthology.org/2021.bionlp-1.4
DOI:
10.18653/v1/2021.bionlp-1.4
Bibkey:
Cite (ACL):
Gjorgjina Cenikj, Tome Eftimov, and Barbara Koroušić Seljak. 2021. SAFFRON: tranSfer leArning For Food-disease RelatiOn extractioN. In Proceedings of the 20th Workshop on Biomedical Language Processing, pages 30–40, Online. Association for Computational Linguistics.
Cite (Informal):
SAFFRON: tranSfer leArning For Food-disease RelatiOn extractioN (Cenikj et al., BioNLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2021.bionlp-1.4.pdf