AdaBioBERT: Adaptive Token Sequence Learning for Biomedical Named Entity Recognition

Sumit Kumar, Tanmay Basu


Abstract
Accurate identification and labeling of biomedical entities, such as diseases, genes, chemical and species, within scientific texts are crucial for understanding complex relationships. We propose Adaptive BERT or AdaBioBERT, a robust named entity recognition (NER) model that builds upon BioBERT (Biomedical Bidirectional Encoded Representation from Transformers) based on an adaptive loss function to learn different types of biomedical token sequence. This adaptive loss function combines the standard Cross Entropy (CE) loss and Conditional Random Field (CRF) loss to optimize both token level accuracy and sequence-level coherence. AdaBioBERT captures rich semantic nuances by leveraging pre-trained contextual embeddings from BioBERT. On the other hand, the CRF loss of AdaBioBERT ensures proper identification of complex multi-token biomedical entities in a sequence and the CE loss can capture the simple unigram entities in a sequence. The empirical analysis on multiple standard biomedical coprora demonstrates that AdaBioBERT performs better than the state of the arts for most of the datasets in terms of macro and micro averaged F1 score.’
Anthology ID:
2025.bionlp-1.6
Volume:
ACL 2025
Month:
August
Year:
2025
Address:
Viena, Austria
Editors:
Dina Demner-Fushman, Sophia Ananiadou, Makoto Miwa, Junichi Tsujii
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
56–62
Language:
URL:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.6/
DOI:
Bibkey:
Cite (ACL):
Sumit Kumar and Tanmay Basu. 2025. AdaBioBERT: Adaptive Token Sequence Learning for Biomedical Named Entity Recognition. In ACL 2025, pages 56–62, Viena, Austria. Association for Computational Linguistics.
Cite (Informal):
AdaBioBERT: Adaptive Token Sequence Learning for Biomedical Named Entity Recognition (Kumar & Basu, BioNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.6.pdf
Supplementarymaterial:
 2025.bionlp-1.6.SupplementaryMaterial.txt
Supplementarymaterial:
 2025.bionlp-1.6.SupplementaryMaterial.zip