LAILab at ArchEHR-QA 2025: Test-time scaling for evidence selection in grounded question answering from electronic health records
Tuan Dung Le, Thanh Duong, Shohreh Haddadan, Behzad Jazayeri, Brandon Manley, Thanh Thieu
Abstract
This paper presents our approach to the ArchEHR shared task on generating answers to real-world patient questions grounded in evidence from electronic health records (EHRs). We investigate the zero-shot capabilities of general-purpose, domain-agnostic large language models (LLMs) in two key aspects: identifying essential supporting evidence and producing concise, coherent answers. To this aim, we propose a two-stage pipeline: (1) evidence identification via test-time scaling (TTS) and (2) generating the final answer conditioned on selected evidences from the previous stage.Our approach leverages high-temperature sampling to generate multiple outputs during the evidence selection phase. This TTS-based approach effectively explore more potential evidences which results in significant improvement of the factuality score of the answers.- Anthology ID:
- 2025.bionlp-share.9
- Volume:
- BioNLP 2025 Shared Tasks
- Month:
- August
- Year:
- 2025
- Address:
- Vienna, Austria
- Editors:
- Sarvesh Soni, Dina Demner-Fushman
- Venues:
- BioNLP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 75–80
- Language:
- URL:
- https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-share.9/
- DOI:
- Cite (ACL):
- Tuan Dung Le, Thanh Duong, Shohreh Haddadan, Behzad Jazayeri, Brandon Manley, and Thanh Thieu. 2025. LAILab at ArchEHR-QA 2025: Test-time scaling for evidence selection in grounded question answering from electronic health records. In BioNLP 2025 Shared Tasks, pages 75–80, Vienna, Austria. Association for Computational Linguistics.
- Cite (Informal):
- LAILab at ArchEHR-QA 2025: Test-time scaling for evidence selection in grounded question answering from electronic health records (Le et al., BioNLP 2025)
- PDF:
- https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-share.9.pdf