Overview of the ClinIQLink 2025 Shared Task on Medical Question-Answering

Brandon Colelough; Davis Bartels; Dina Demner-Fushman

Overview of the ClinIQLink 2025 Shared Task on Medical Question-Answering

Brandon Colelough, Davis Bartels, Dina Demner-Fushman

Abstract

In this paper, we present an overview of CLINIQLINK a shared task, collocated with the 24th BioNLP workshop at ACL 2025, designed to stress-test large language models (LLMs) on medically-oriented question answering aimed at the level of a General Practitioner. The challenge supplies 4 978 expert-verified, medical source-grounded question–answer pairs that cover seven formats - true/false, multiple choice, unordered list, short answer, short-inverse, multi-hop, and multi-hop-inverse. Participating systems, bundled in Docker or Apptainer images, are executed on the CodaBench platform or the University of Maryland’s Zaratan cluster. An automated harness (Task 1) scores closed-ended items by exact match and open-ended items with a three-tier embedding metric. A subsequent physician panel (Task 2) audits the top model responses.

Anthology ID:: 2025.bionlp-1.32
Volume:: ACL 2025
Month:: August
Year:: 2025
Address:: Viena, Austria
Editors:: Dina Demner-Fushman, Sophia Ananiadou, Makoto Miwa, Junichi Tsujii
Venues:: BioNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 378–387
Language:
URL:: https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.32/
DOI:
Bibkey:
Cite (ACL):: Brandon Colelough, Davis Bartels, and Dina Demner-Fushman. 2025. Overview of the ClinIQLink 2025 Shared Task on Medical Question-Answering. In ACL 2025, pages 378–387, Viena, Austria. Association for Computational Linguistics.
Cite (Informal):: Overview of the ClinIQLink 2025 Shared Task on Medical Question-Answering (Colelough et al., BioNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.32.pdf
Supplementarymaterial:: 2025.bionlp-1.32.SupplementaryMaterial.txt

PDF Cite Search Supplementarymaterial Fix data