@inproceedings{zhu-etal-2025-integrating,
    title = "Integrating Visual Modalities with Large Language Models for Mental Health Support",
    author = "Zhu, Zhouan  and
      Wang, Shangfei  and
      Wang, Yuxin  and
      Wu, Jiaqiang",
    editor = "Rambow, Owen  and
      Wanner, Leo  and
      Apidianaki, Marianna  and
      Al-Khalifa, Hend  and
      Eugenio, Barbara Di  and
      Schockaert, Steven",
    booktitle = "Proceedings of the 31st International Conference on Computational Linguistics",
    month = jan,
    year = "2025",
    address = "Abu Dhabi, UAE",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2025.coling-main.599/",
    pages = "8939--8954",
    abstract = "Current work of mental health support primarily utilizes unimodal textual data and often fails to understand and respond to users' emotional states comprehensively. In this study, we introduce a novel framework that enhances Large Language Model (LLM) performance in mental health dialogue systems by integrating multimodal inputs. Our framework uses visual language models to analyze facial expressions and body movements, then combines these visual elements with dialogue context and counseling strategies. This approach allows LLMs to generate more nuanced and supportive responses. The framework comprises four components: in-context learning via computation of semantic similarity; extraction of facial expression descriptions through visual modality data; integration of external knowledge from a knowledge base; and delivery of strategic guidance through a strategy selection module. Both automatic and human evaluations confirm that our approach outperforms existing models, delivering more empathetic, coherent, and contextually relevant mental health support responses."
}Markdown (Informal)
[Integrating Visual Modalities with Large Language Models for Mental Health Support](https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2025.coling-main.599/) (Zhu et al., COLING 2025)
ACL