BioRAG: A Systematic Ablation Study of Retrieval Strategies for Biomedical Question Answering

Krushil Bhojani, Mayank Waghmare, Hima Bindu Nandyala


Abstract
Retrieval strategy selection is a critical but understudied design decision in biomedical RAG systems. Existing evaluations rely on lexical metrics that miss answer grounding, or require proprietary infrastructure that limits reproducibility. We present BioRAG, a head-to-head ablation of seven retrieval strategies on BioASQ-13b, evaluated using four RAGAs metrics with a locally deployed judge at zero monetary cost. Hybrid BM25 plus dense retrieval with Reciprocal Rank Fusion achieves faithfulness of 0.534 and context recall of 0.507, improvements of 50% and 85% over naive dense retrieval, confirmed across three random seed re-samples. HyDE improves faithfulness by 14% but reduces context precision by 52%, a tradeoff not previously documented on BioASQ. No single strategy dominates all four metrics, indicating that strategy selection must be application-driven. Sensitivity analysis across k in {3,5,10} confirms ranking stability. A domain mismatch diagnostic confirms 2% corpus coverage failure. The full pipeline runs on consumer hardware without paid APIs, directly addressing BioNLP 2026’s emphasis on reproducibility and evaluation frameworks for health-related applications.
Anthology ID:
2026.bionlp-1.10
Volume:
BioNLP 2026
Month:
July
Year:
2026
Address:
San Diego, California
Editors:
Dina Demner-Fushman, Sophia Ananiadou, Kirk Roberts, Junichi Tsujii
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
104–114
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-1.10/
DOI:
Bibkey:
Cite (ACL):
Krushil Bhojani, Mayank Waghmare, and Hima Bindu Nandyala. 2026. BioRAG: A Systematic Ablation Study of Retrieval Strategies for Biomedical Question Answering. In BioNLP 2026, pages 104–114, San Diego, California. Association for Computational Linguistics.
Cite (Informal):
BioRAG: A Systematic Ablation Study of Retrieval Strategies for Biomedical Question Answering (Bhojani et al., BioNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-1.10.pdf