SocialStep: Fast Prediction of Social Determinants of Health

Paul Landes, Adam Richard Cross, Jimeng Sun


Abstract
Given thousands of medical documents, how can we automatically uncover patients’ social risk factors? Social Determinants of Health (SDoH) constitute a growing class of non-clinical risk factors that shape patient trajectories. While clinically significant, automatic detection of SDoH from free text remains understudied due to scarce and imbalanced training data. Current approaches often rely on monolithic large language models. We present SocialStep, a two-step hybrid pipeline that first uses a lightweight classifier to triage sentences and then applies a Large Language Model (LLM) for multilabel classification to the relevant subset. On the Medical Information Mart for Intensive Care III (MIMIC-III) dataset, SocialStep improves macro F1 by 5 points over the state-of-the-art baseline while running 12.2× faster. These findings demonstrate that integrating compact neural encoders with large language models provides a scalable and highly accurate framework for clinical NLP tasks, including SDoH extraction. Notably, we also observe some unexpected patterns in LLM performance. SocialStep offers a practical blueprint for hybrid model deployment that identifies critical social risk factors without prohibitive computational cost.
Anthology ID:
2026.lrec-main.846
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
10802–10814
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.846/
DOI:
Bibkey:
Cite (ACL):
Paul Landes, Adam Richard Cross, and Jimeng Sun. 2026. SocialStep: Fast Prediction of Social Determinants of Health. International Conference on Language Resources and Evaluation, main:10802–10814.
Cite (Informal):
SocialStep: Fast Prediction of Social Determinants of Health (Landes et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.846.pdf