Response Content Units: Evaluating Completeness and Proactiveness in Medical Open-Response Question Answering
Yongsin Park, Wen-wai Yim, Emma McKibbin, Asma Ben Abacha, Fei Xia
Abstract
Remote clinical care has significantly increased the workload for healthcare professionals managing digital inquiries. While automated systems aim to alleviate this burden, consumer health questions present unique challenges due to their linguistic complexity and the need for proactive clinical guidance, which traditional question-answering models often overlook. We introduce the medical Response Content Units (RCU) schema, a framework that facilitates automatic analysis to identify question-answer completeness and critical answer subparts, which can then be used as tools for supporting clinician response or for automatic metric evaluation. Our analysis using this schema reveals a 16.4% gap in response completeness in professional replies and demonstrates that essential medical directives are provided 2.4 to 12.1 times as frequently as direct answers. We provide baseline results and publicly release our annotations and source code to offer an evaluation framework that is more closely aligned with real-world clinical requirements.- Anthology ID:
- 2026.gem-main.25
- Volume:
- Proceedings of the Fifth Workshop on Generation, Evaluation and Metrics (GEM)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, USA
- Editors:
- Simon Mille, Sebastian Gehrmann, Patrícia Schmidtová, Ondřej Dušek, Marzieh Fadaee, Kyle Lo, Enrico Santus, Gabriel Stanovsky
- Venues:
- GEM | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 231–252
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.gem-main.25/
- DOI:
- Cite (ACL):
- Yongsin Park, Wen-wai Yim, Emma McKibbin, Asma Ben Abacha, and Fei Xia. 2026. Response Content Units: Evaluating Completeness and Proactiveness in Medical Open-Response Question Answering. In Proceedings of the Fifth Workshop on Generation, Evaluation and Metrics (GEM), pages 231–252, San Diego, California, USA. Association for Computational Linguistics.
- Cite (Informal):
- Response Content Units: Evaluating Completeness and Proactiveness in Medical Open-Response Question Answering (Park et al., GEM 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.gem-main.25.pdf