John Torous
2026
You Never Know a Person, You Only Know Their Defenses: Detecting Levels of Psychological Defense Mechanisms in Supportive Conversations
Hongbin Na | Zimu Wang | Zhaoming Chen | Peilin Zhou | Yining Hua | Grace Ziqi Zhou | Haiyang Zhang | Tao Shen | Wei Wang | John Torous | Shaoxiong Ji | Ling Chen
Findings of the Association for Computational Linguistics: ACL 2026
Hongbin Na | Zimu Wang | Zhaoming Chen | Peilin Zhou | Yining Hua | Grace Ziqi Zhou | Haiyang Zhang | Tao Shen | Wei Wang | John Torous | Shaoxiong Ji | Ling Chen
Findings of the Association for Computational Linguistics: ACL 2026
Psychological defenses are strategies, often automatic, that people use to manage distress. Rigid use or overuse of defenses is negatively linked to mental health and shapes what speakers disclose and how they accept or resist help. However, defenses are complex and difficult to reliably measure, particularly in clinical dialogues. We introduce PsyDefConv, a dialogue corpus with help seeker utterances labeled for defense level, and DMRS Co-Pilot, a four-stage pipeline that provides evidence-based pre-annotations. The corpus contains 200 dialogues and 4,709 utterances, including 2,336 help seeker turns, with double-blind labeling reaching Cohen’s kappa of 0.639. In a counterbalanced study, the co-pilot reduced average annotation time by 24.0%. In expert review, it averaged 4.62 for evidence supportiveness, 4.44 for clinical plausibility, and 4.40 for insight on a seven-point scale. Benchmarks with strong large language models (LLMs) in zero-shot and fine-tuning settings demonstrate clear headroom, with the best macro F1-score around 30% and a tendency to overpredict mature defenses. Corpus analyses confirm that mature defenses are most common and reveal emotion-specific deviations. We release the corpus, annotations, code, and prompts to support research on defensive functioning in language.
Overview of the PsyDefDetect Shared Task at BioNLP 2026: Detecting Levels of Psychological Defense Mechanisms in Supportive Conversations
Hongbin Na | Zimu Wang | Zhaoming Chen | Yining Hua | Rena Gao | Kailai Yang | Ling Chen | Wei Wang | Shaoxiong Ji | John Torous | Sophia Ananiadou
BioNLP 2026
Hongbin Na | Zimu Wang | Zhaoming Chen | Yining Hua | Rena Gao | Kailai Yang | Ling Chen | Wei Wang | Shaoxiong Ji | John Torous | Sophia Ananiadou
BioNLP 2026
We present an overview of PsyDefDetect, the shared task on detecting levels of psychological defense mechanisms in emotional support dialogues, co-located with BioNLP@ACL 2026. Grounded in the clinically validated Defense Mechanism Rating Scales (DMRS) framework, the task asks systems to classify a target seeker utterance, given its preceding dialogue context, into one of nine categories: seven hierarchical DMRS levels plus two auxiliary labels. Participants worked on PsyDefConv, a newly released corpus of 200 dialogues and 2336 help-seeker utterances annotated under DMRS with substantial inter-annotator agreement. The task attracted 172 participants on CodaBench who produced 563 submissions, with 21 teams officially registering their results for the final ranking. The best system achieved a macro F1-score of 0.420, surpassing the strongest fine-tuned baseline reported in the dataset paper by a notable margin, yet leaving clear headroom. Our analysis highlights (i) a persistent tendency to over-predict the majority High-Adaptive class, (ii) a widening gap between accuracy and macro-F1 that reveals class-imbalance sensitivity, and (iii) the value of theory-aware and LLM-based approaches for fine-grained defensive-function classification. We release all task materials and invite the community to continue work on this novel intersection of clinical psychology and NLP.
2025
A Survey of Large Language Models in Psychotherapy: Current Landscape and Future Directions
Hongbin Na | Yining Hua | Zimu Wang | Tao Shen | Beibei Yu | Lilin Wang | Wei Wang | John Torous | Ling Chen
Findings of the Association for Computational Linguistics: ACL 2025
Hongbin Na | Yining Hua | Zimu Wang | Tao Shen | Beibei Yu | Lilin Wang | Wei Wang | John Torous | Ling Chen
Findings of the Association for Computational Linguistics: ACL 2025
Mental health is increasingly critical in contemporary healthcare, with psychotherapy demanding dynamic, context-sensitive interactions that traditional NLP methods struggle to capture. Large Language Models (LLMs) offer significant potential for addressing this gap due to their ability to handle extensive context and multi-turn reasoning. This review introduces a conceptual taxonomy dividing psychotherapy into interconnected stages–assessment, diagnosis, and treatment–to systematically examine LLM advancements and challenges. Our comprehensive analysis reveals imbalances in current research, such as a focus on common disorders, linguistic biases, fragmented methods, and limited theoretical integration. We identify critical challenges including capturing dynamic symptom fluctuations, overcoming linguistic and cultural biases, and ensuring diagnostic reliability. Highlighting future directions, we advocate for continuous multi-stage modeling, real-time adaptive systems grounded in psychological theory, and diversified research covering broader mental disorders and therapeutic approaches, aiming toward more holistic and clinically integrated psychotherapy LLMs systems.