Francisco Pereira


2025

pdf bib
5cNLP at BioLaySumm2025: Prompts, Retrieval, and Multimodal Fusion
Juan Antonio Lossio-Ventura | Callum Chan | Arshitha Basavaraj | Hugo Alatrista-Salas | Francisco Pereira | Diana Inkpen
BioNLP 2025 Shared Tasks

In this work, we present our approach to addressing all subtasks of the BioLaySumm 2025 shared task by leveraging prompting and retrieval strategies, as well as multimodal input fusion. Our method integrates: (1) zero-shot and few-shot prompting with large language models (LLMs); (2) semantic similarity-based dynamic few-shot prompting; (3) retrieval-augmented generation (RAG) incorporating biomedical knowledge from the Unified Medical Language System (UMLS); and (4) a multimodal fusion pipeline that combines images and captions using image-text-to-text generation for enriched lay summarization. Our framework enables lightweight adaptation of pretrained LLMs for generating lay summaries from scientific articles and radiology reports. Using modern LLMs, including Llama-3.3-70B-Instruct and GPT-4.1, our 5cNLP team achieved third place in Subtask 1.2 and second place in Subtask 2.1, among all submissions.

2010

pdf bib
Learning semantic features for fMRI data from definitional text
Francisco Pereira | Matthew Botvinick | Greg Detre
Proceedings of the NAACL HLT 2010 First Workshop on Computational Neurolinguistics