Shichao Fang
2025
Fine-tuning LLMs to Extract Epilepsy Seizure Frequency Data from Health Records
Ben Holgate
|
Joe Davies
|
Shichao Fang
|
Joel Winston
|
James Teo
|
Mark Richardson
ACL 2025
We developed a new methodology of extracting the frequency of a patient’s epilepsy seizures from unstructured, free-text outpatient clinic letters by: first, devising a singular unit of measurement for seizure frequency; and second, fine-tuning a generative Large Language Model (LLM) on our bespoke annotated dataset. We measured frequency by the number of seizures per month: one seizure or more requires an integer; and less than one a decimal. This approach enables us to track whether a patient”s seizures are improving or not over time. We found fine-tuning improves the F1 score of our best-performing LLM, Ministral-8B-Instruct-2410, by around three times compared to an untrained model. We also found Ministral demonstrated an impressive ability for mathematical reasoning.
2024
Extracting Epilepsy Patient Data with Llama 2
Ben Holgate
|
Shichao Fang
|
Anthony Shek
|
Matthew McWilliam
|
Pedro Viana
|
Joel S. Winston
|
James T. Teo
|
Mark P. Richardson
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing
We fill a gap in scholarship by applying a generative Large Language Model (LLM) to extract information from clinical free text about the frequency of seizures experienced by people with epilepsy. Seizure frequency is difficult to determine across time from unstructured doctors’ and nurses’ reports of outpatients’ visits that are stored in Electronic Health Records (EHRs) in the United Kingdom’s National Health Service (NHS). We employ Meta’s Llama 2 to mine the EHRs of people with epilepsy and determine, where possible, a person’s seizure frequency at a given point in time. The results demonstrate that the new, powerful generative LLMs may improve outcomes for clinical NLP research in epilepsy and other areas.
Search
Fix author
Co-authors
- Ben Holgate 2
- Joe Davies 1
- Matthew McWilliam 1
- Mark P. Richardson 1
- Mark Richardson 1
- show all...