Section-Level Simplification of Biomedical Abstracts

Jan Bakker, Jaap Kamps


Abstract
Cochrane produces systematic reviews whose abstracts are divided into seven standard sections. However, the plain language summaries (PLS) of Cochrane reviews do not adhere to the same structure, which has prevented researchers from training simplification models on paired abstract and PLS sections. In this work, we devise a two-step method to automatically divide PLS of Cochrane reviews into the same sections in which abstracts are divided. In the first step, we align each sentence in a PLS to a section in the parallel abstract if they cover similar content. In the second step, we classify the remaining sentences into sections based on the content of the PLS and what we learned from the first step. We manually divide 22 PLS into sections to evaluate our method. Upon execution of our method, we obtain the Cochrane-sections dataset, which consists of paired abstract and PLS sections in English for a total of 7.7K Cochrane reviews. Thus, our work yields references for the section-level simplification of biomedical abstracts.
Anthology ID:
2025.emnlp-main.697
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
13830–13844
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.697/
DOI:
Bibkey:
Cite (ACL):
Jan Bakker and Jaap Kamps. 2025. Section-Level Simplification of Biomedical Abstracts. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 13830–13844, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Section-Level Simplification of Biomedical Abstracts (Bakker & Kamps, EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.697.pdf
Checklist:
 2025.emnlp-main.697.checklist.pdf