UniBuc-SB at ArchEHR-QA 2025: A Resource-Constrained Pipeline for Relevance Classification and Grounded Answer Synthesis

Sebastian Balmus, Dura Bogdan, Ana Sabina Uban


Abstract
We describe the UniBuc-SB submission to the ArchEHR-QA shared task, which involved generating grounded answers to patient questions based on electronic health records. Our system exceeded the performance of the provided baseline, achieving higher performance in generating contextually relevant responses. Notably, we developed our approach under constrained computational resources, utilizing only a single NVIDIA RTX 4090 GPU. We refrained from incorporating any external datasets, relying solely on the limited training data supplied by the organizers. To address the challenges posed by the low-resource setting, we leveraged off-the-shelf pre-trained language models and fine-tuned them minimally, aiming to maximize performance while minimizing overfitting.
Anthology ID:
2025.bionlp-share.7
Volume:
Proceedings of the 24th Workshop on Biomedical Language Processing (Shared Tasks)
Month:
August
Year:
2025
Address:
Vienna, Austria
Editors:
Sarvesh Soni, Dina Demner-Fushman
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
62–68
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.bionlp-share.7/
DOI:
10.18653/v1/2025.bionlp-share.7
Bibkey:
Cite (ACL):
Sebastian Balmus, Dura Bogdan, and Ana Sabina Uban. 2025. UniBuc-SB at ArchEHR-QA 2025: A Resource-Constrained Pipeline for Relevance Classification and Grounded Answer Synthesis. In Proceedings of the 24th Workshop on Biomedical Language Processing (Shared Tasks), pages 62–68, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
UniBuc-SB at ArchEHR-QA 2025: A Resource-Constrained Pipeline for Relevance Classification and Grounded Answer Synthesis (Balmus et al., BioNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.bionlp-share.7.pdf