Abstract
Medical Report Generation (MRG) is a sub-task of Natural Language Generation (NLG) and aims to present information from various sources in textual form and synthesize salient information, with the goal of reducing the time spent by domain experts in writing medical reports and providing support information for decision-making. Given the specificity of the medical domain, the evaluation of automatically generated medical reports is of paramount importance to the validity of these systems. Therefore, in this paper, we focus on the evaluation of automatically generated medical reports from the perspective of automatic and human evaluation. We present evaluation methods for general NLG evaluation and how they have been applied to domain-specific medical tasks. The study shows that MRG evaluation methods are very diverse, and that further work is needed to build shared evaluation methods. The state of the art also emphasizes that such an evaluation must be task specific and include human assessments, requesting the participation of experts in the field.- Anthology ID:
- 2023.clinicalnlp-1.48
- Volume:
- Proceedings of the 5th Clinical Natural Language Processing Workshop
- Month:
- July
- Year:
- 2023
- Address:
- Toronto, Canada
- Editors:
- Tristan Naumann, Asma Ben Abacha, Steven Bethard, Kirk Roberts, Anna Rumshisky
- Venue:
- ClinicalNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 447–459
- Language:
- URL:
- https://aclanthology.org/2023.clinicalnlp-1.48
- DOI:
- 10.18653/v1/2023.clinicalnlp-1.48
- Cite (ACL):
- Yongxin Zhou, Fabien Ringeval, and François Portet. 2023. A Survey of Evaluation Methods of Generated Medical Textual Reports. In Proceedings of the 5th Clinical Natural Language Processing Workshop, pages 447–459, Toronto, Canada. Association for Computational Linguistics.
- Cite (Informal):
- A Survey of Evaluation Methods of Generated Medical Textual Reports (Zhou et al., ClinicalNLP 2023)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2023.clinicalnlp-1.48.pdf