Evaluating Document-Tuned Transformer Representations for Person-level Mental Health Assessment

Aaron Marker; Oscar Kjell; Vasudha Varadarajan; H. Andrew Schwartz

Evaluating Document-Tuned Transformer Representations for Person-level Mental Health Assessment

Aaron Marker, Oscar Kjell, Vasudha Varadarajan, H. Andrew Schwartz

Abstract

Person-level psychological assessment requires aggregating meaning across many messages from the same individual, a task that document-level training objectives were not explicitly designed for. We present a systematic, empirical comparison between architecturally matched traditional (a) base-transformers and (b) document-tuned-transformers (further contrastively fine-tuned at the document-level, sometimes referred to as "sentence transformers") under otherwise identical conditions. Comparing layer-wise and overall performance across two longitudinal mental health and psychological datasets, we find document-tuned models demonstrated a consistent improvement over base representations (increase in Pearson r of 13.4%, p=.015). Robustness analyses revealed document-tuned models remained more accurate under perturbations to word deletion, synonym replacement, typo injection, and back translation. Further, hedged language (e.g., ’usually’) was more characteristic of outcomes in document-tuned embeddings while abundance (e.g., ’lot’) was more characteristic of base-transformers, suggesting document-tuned models may better capture uncertainty.These results suggest representation choice impacts mental health prediction, document-tuned models often being more adept.

Anthology ID:: 2026.clpsych-1.14
Volume:: Proceedings of the 10th Workshop on Computational Linguistics and Clinical Psychology (CLPsych 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Aya Zirikly, Kfir Bar, Sean MacAvaney, Molly Ireland, Yaakov Ophir, Dana Atzil-Slonim, Vasudha Varadarajan, Steven Bedrick, Bart Desmet
Venues:: CLPsych | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 178–187
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.clpsych-1.14/
DOI:
Bibkey:
Cite (ACL):: Aaron Marker, Oscar Kjell, Vasudha Varadarajan, and H. Andrew Schwartz. 2026. Evaluating Document-Tuned Transformer Representations for Person-level Mental Health Assessment. In Proceedings of the 10th Workshop on Computational Linguistics and Clinical Psychology (CLPsych 2026), pages 178–187, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Evaluating Document-Tuned Transformer Representations for Person-level Mental Health Assessment (Marker et al., CLPsych 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.clpsych-1.14.pdf

PDF Cite Search Fix data