Dhruv Jain
2025
Towards Understanding LLM-Generated Biomedical Lay Summaries
Rohan Charudatt Salvi
|
Swapnil Panigrahi
|
Dhruv Jain
|
Shweta Yadav
|
Md. Shad Akhtar
Proceedings of the Second Workshop on Patient-Oriented Language Processing (CL4Health)
In this paper, we investigate using large language models to generate accessible lay summaries of medical abstracts, targeting non-expert audiences. We assess the ability of models like GPT-4 and LLaMA 3-8B-Instruct to simplify complex medical information, focusing on layness, comprehensiveness, and factual accuracy. Utilizing both automated and human evaluations, we discover that automatic metrics do not always align with human judgments. Our analysis highlights the potential benefits of developing clear guidelines for consistent evaluations conducted by non-expert reviewers. It also points to areas for improvement in the evaluation process and the creation of lay summaries for future research.