Why Are We Lonely? Leveraging LLMs to Measure and Understand Loneliness in Caregivers and Non-caregivers

Michelle Damin Kim; Ellie S. Paek; Yufen Lin; Emily Mroz; Jane Chung; Jinho D. Choi

Why Are We Lonely? Leveraging LLMs to Measure and Understand Loneliness in Caregivers and Non-caregivers

Michelle Damin Kim, Ellie S. Paek, Yufen Lin, Emily Mroz, Jane Chung, Jinho D. Choi

Abstract

This paper presents an LLM-driven approach for constructing diverse social media datasets to measure and compare loneliness in the caregiver and non-caregiver populations. We introduce an expert-developed loneliness evaluation framework and an expert-informed typology for categorizing causes of loneliness for analyzing social media text. Using a human-validated data processing pipeline, we apply GPT-4o, GPT-5-nano, and GPT-5 to build a high-quality Reddit corpus and analyze loneliness across both populations. The loneliness evaluation framework achieved average accuracies of 76.09% and 79.78% for caregivers and non-caregivers, respectively. The cause categorization framework achieved micro-aggregate F1 scores of 0.825 and 0.80 for caregivers and non-caregivers, respectively. Across populations, we observe substantial differences in the distribution of types of causes of loneliness. Caregivers’ loneliness were predominantly linked to caregiving roles, identity recognition, and feelings of abandonment, indicating distinct loneliness experiences between the two groups. Demographic extraction further demonstrates the viability of Reddit for building a diverse caregiver loneliness dataset. Overall, this work establishes an LLM-based pipeline for creating high quality social media datasets for studying loneliness and demonstrates its effectiveness in analyzing population-level differences in the manifestation of loneliness.

Anthology ID:: 2026.healing-1.19
Volume:: Proceedings of the 1st Workshop on Linguistic Analysis for Health (HeaLing 2026)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Danilova, Murathan Kurfalı, Ylva Söderfeldt, Julia Reed, Andrew Burchell
Venues:: HeaLing | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 222–235
Language:
URL:: https://preview.aclanthology.org/ingest-eacl/2026.healing-1.19/
DOI:
Bibkey:
Cite (ACL):: Michelle Damin Kim, Ellie S. Paek, Yufen Lin, Emily Mroz, Jane Chung, and Jinho D. Choi. 2026. Why Are We Lonely? Leveraging LLMs to Measure and Understand Loneliness in Caregivers and Non-caregivers. In Proceedings of the 1st Workshop on Linguistic Analysis for Health (HeaLing 2026), pages 222–235, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Why Are We Lonely? Leveraging LLMs to Measure and Understand Loneliness in Caregivers and Non-caregivers (Kim et al., HeaLing 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-eacl/2026.healing-1.19.pdf

PDF Cite Search Fix data