Seahawk at MedGenVidQA 2026: LLM Segment-Range Selection for Medical Visual Answer Localization

Xiaotian Tian, Gulustan Dogan


Abstract
Medical visual answer localization requires identifying the temporal span in a video where a medical question is answered or visually explained. We present a simple retrieval-and-selection pipeline for Task C that treats visual answer localization as segment-level answer paragraph selection over timestamped video transcripts. Given a question and a segmented transcript, our system prompts DeepSeek to select a contiguous range of transcript segments rather than directly generating timestamps. The final start and end times are then computed deterministically from the selected segment boundaries, decreasing the risk of hallucinated or malformed temporal outputs. To support long videos, we apply overlapping sliding-window prompting and rank candidate ranges using lexical question. In a 20-sample sanity check on test dataset, a completeness-biased configuration achieved an mIoU of 0.3217, while a shorter duration-penalized configuration improved performance to 0.4815. These results suggest that constrained LLM-based segment selection, combined with deterministic timestamp extraction, is a practical baseline for medical visual answer localization.
Anthology ID:
2026.bionlp-2.34
Volume:
Proceedings of the BioNLP 2026 (Shared Tasks)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Deepak Gupta, Dina Demner-Fushman
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
257–261
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-2.34/
DOI:
Bibkey:
Cite (ACL):
Xiaotian Tian and Gulustan Dogan. 2026. Seahawk at MedGenVidQA 2026: LLM Segment-Range Selection for Medical Visual Answer Localization. In Proceedings of the BioNLP 2026 (Shared Tasks), pages 257–261, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Seahawk at MedGenVidQA 2026: LLM Segment-Range Selection for Medical Visual Answer Localization (Tian & Dogan, BioNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-2.34.pdf
Supplementarymaterial:
 2026.bionlp-2.34.SupplementaryMaterial.txt
Supplementarymaterial:
 2026.bionlp-2.34.SupplementaryMaterial.zip