DLNLP at ClinicalSkillQA: EvidenceFlow for Structured Zero-Shot Clinical Keyframe Ordering

Kexin Li, Zhekun Wang, Yiran Wang, Di Zhao


Abstract
The ClinSkill QA shared task requires models to recover the temporal order of scrambled clinical keyframes and generate explanations. We propose EvidenceFlow, a structured zero-shot framework based on Qwen2.5-VL that decomposes the task into global overview, local evidence modeling, and ordering decision, with two variants: model-led EvidenceFlow-M and rule-guided EvidenceFlow-R. On the official test set, EvidenceFlow-R achieves better ordering performance, while EvidenceFlow-M produces better explanation quality, revealing a trade-off between ordering stability and rationale generation. EvidenceFlow provides an interpretable zero-shot baseline for clinical keyframe ordering.
Anthology ID:
2026.bionlp-2.5
Volume:
Proceedings of the BioNLP 2026 (Shared Tasks)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Deepak Gupta, Dina Demner-Fushman
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
33–37
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-2.5/
DOI:
Bibkey:
Cite (ACL):
Kexin Li, Zhekun Wang, Yiran Wang, and Di Zhao. 2026. DLNLP at ClinicalSkillQA: EvidenceFlow for Structured Zero-Shot Clinical Keyframe Ordering. In Proceedings of the BioNLP 2026 (Shared Tasks), pages 33–37, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
DLNLP at ClinicalSkillQA: EvidenceFlow for Structured Zero-Shot Clinical Keyframe Ordering (Li et al., BioNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-2.5.pdf
Supplementarymaterial:
 2026.bionlp-2.5.SupplementaryMaterial.zip