Understanding LLMs’ summarization capabilities: an analysis of biomedical abstract and lay summary generation

Batuhan Nursal; Cassie S. Mitchell

Understanding LLMs’ summarization capabilities: an analysis of biomedical abstract and lay summary generation

Abstract

Scientific abstracts and lay summaries serve distinct but critical roles in research communication. Abstracts use technical language for academic audiences, while lay summaries aim to make findings accessible to non-specialists. With the rise of large language models (LLMs), there is increasing interest in automating the generation of both types of summaries—especially in the biomedical domain, where clarity and factual accuracy are essential. This study evaluates the performance of lightweight LLMs (under 8B parameters) in generating biomedical abstracts and lay summaries in a zero-shot setting. We assess outputs across three key dimensions: relevance, readability, and factuality. Additionally, we introduce a novel analysis of the sectional origin and desirability of information—where desirability reflects the utility of content from the reader’s perspective. We further compare human and LLM preferences using an objective ranking task. Our results show that LLM-generated summaries often contain comparable levels of desirable information to gold-standard human references. In several cases, LLM outputs are preferred by human evaluators and occasionally mistaken for human-authored text. These findings demonstrate the potential of lightweight LLMs for scalable, high-quality summarization and suggest their practical use in domains requiring both technical and accessible communication. The codebase for this study is publicly available on GitHub: https://github.com/batuinmetz/Understanding-LLMs-summarization-capabilities

Anthology ID:: 2026.findings-acl.554
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 11393–11417
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.findings-acl.554/
DOI:
Bibkey:
Cite (ACL):: Batuhan Nursal and Cassie S. Mitchell. 2026. Understanding LLMs’ summarization capabilities: an analysis of biomedical abstract and lay summary generation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 11393–11417, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Understanding LLMs’ summarization capabilities: an analysis of biomedical abstract and lay summary generation (Nursal & Mitchell, Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.findings-acl.554.pdf
Checklist:: 2026.findings-acl.554.checklist.pdf

PDF Cite Search Checklist Fix data