Cassie S. Mitchell
2025
LLM as Entity Disambiguator for Biomedical Entity-Linking
Christophe Ye
|
Cassie S. Mitchell
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Entity linking involves normalizing a mention in medical text to a unique identifier in a knowledge base, such as UMLS or MeSH. Most entity linkers follow a two-stage process: first, a candidate generation step selects high-quality candidates, and then a named entity disambiguation phase determines the best candidate for final linking. This study demonstrates that leveraging a large language model (LLM) as an entity disambiguator significantly enhances entity linking models’ accuracy and recall. Specifically, the LLM disambiguator achieves remarkable improvements when applied to alias-matching entity linking methods. Without any fine-tuning, our approach establishes a new state-of-the-art (SOTA), surpassing previous methods on multiple prevalent biomedical datasets by up to 16 points in accuracy. We released our code on GitHub at https://github.com/ChristopheYe/llm_disambiguator
BioEL: A Comprehensive Python Package for Biomedical Entity Linking
Prasanth Bathala
|
Christophe Ye
|
Batuhan Nursal
|
Shubham Lohiya
|
David Kartchner
|
Cassie S. Mitchell
Findings of the Association for Computational Linguistics: NAACL 2025