Exploring Partial Knowledge Base Inference in Biomedical Entity Linking

Hongyi Yuan, Keming Lu, Zheng Yuan


Abstract
Biomedical entity linking (EL) consists of named entity recognition (NER) and named entity disambiguation (NED). EL models are trained on corpora labeled by a predefined KB. However, it is a common scenario that only entities within a subset of the KB are precious to stakeholders. We name this scenario partial knowledge base inference; training an EL model with one KB and inferring on the part of it without further training. In this work, we give a detailed definition and evaluation procedures for this practically valuable but significantly understudied scenario and evaluate methods from three representative EL paradigms. We construct partial KB inference benchmarks and witness a catastrophic degradation in EL performance due to dramatically precision drop.Our findings reveal these EL paradigms can not correctly handle unlinkable mentions (NIL), so they are not robust to partial KB inference. We also propose two simple-and-effective redemption methods to combat the NIL issue with little computational overhead.
Anthology ID:
2023.bionlp-1.3
Volume:
The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks
Month:
July
Year:
2023
Address:
Toronto, Canada
Venue:
BioNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
37–49
Language:
URL:
https://aclanthology.org/2023.bionlp-1.3
DOI:
10.18653/v1/2023.bionlp-1.3
Bibkey:
Cite (ACL):
Hongyi Yuan, Keming Lu, and Zheng Yuan. 2023. Exploring Partial Knowledge Base Inference in Biomedical Entity Linking. In The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, pages 37–49, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Exploring Partial Knowledge Base Inference in Biomedical Entity Linking (Yuan et al., BioNLP 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/remove-xml-comments/2023.bionlp-1.3.pdf