Thesis Proposal: Detecting Empathy Using Multimodal Language Model
Md Rakibul Hasan, Md Zakir Hossain, Aneesh Krishna, Shafin Rahman, Tom Gedeon
Abstract
Empathy is crucial in numerous social interactions, including human-robot, patient-doctor, teacher-student, and customer-call centre conversations. Despite its importance, empathy detection in videos continues to be a challenging task because of the subjective nature of empathy and often remains under-explored. Existing studies have relied on scripted or semi-scripted interactions in text-, audio-, or video-only settings that fail to capture the complexities and nuances of real-life interactions. This PhD research aims to fill these gaps by developing a multimodal language model (MMLM) that detects empathy in audiovisual data. In addition to leveraging existing datasets, the proposed study involves collecting real-life interaction video and audio. This study will leverage optimisation techniques like neural architecture search to deliver an optimised small-scale MMLM. Successful implementation of this project has significant implications in enhancing the quality of social interactions as it enables real-time measurement of empathy and thus provides potential avenues for training for better empathy in interactions.- Anthology ID:
- 2024.eacl-srw.27
- Volume:
- Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
- Month:
- March
- Year:
- 2024
- Address:
- St. Julian’s, Malta
- Editors:
- Neele Falk, Sara Papi, Mike Zhang
- Venue:
- EACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 338–349
- Language:
- URL:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.eacl-srw.27/
- DOI:
- Cite (ACL):
- Md Rakibul Hasan, Md Zakir Hossain, Aneesh Krishna, Shafin Rahman, and Tom Gedeon. 2024. Thesis Proposal: Detecting Empathy Using Multimodal Language Model. In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pages 338–349, St. Julian’s, Malta. Association for Computational Linguistics.
- Cite (Informal):
- Thesis Proposal: Detecting Empathy Using Multimodal Language Model (Hasan et al., EACL 2024)
- PDF:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.eacl-srw.27.pdf