Qianli Xing
2026
From Coarse to Fine: A Multi-Granularity Multimodal Framework for Teacher Sentiment Analysis
Zhiyi Duan | Xiangren Wang | Jiangshan Guan | Bing Jia | Qianli Xing
Findings of the Association for Computational Linguistics: ACL 2026
Zhiyi Duan | Xiangren Wang | Jiangshan Guan | Bing Jia | Qianli Xing
Findings of the Association for Computational Linguistics: ACL 2026
Teacher sentiment analysis is pivotal for understanding instructional dynamics, yet it remains challenging because classroom expressions are professionally regulated performances rather than spontaneous outbursts. However, existing approaches typically treat sentiment as a static, monolithic label, failing to capture this structured heterogeneity. To effectively model this complexity, we decompose teacher sentiment into three granularities: coarse-level performativity, medium-level intra-class heterogeneity, and fine-level cross-modal complementarity. Guided by this perspective, we propose CF-TSA, a coarse-to-fine multimodal framework. Specifically, we employ CLS-guided cross-modal attention to recover effective signals from regulated displays (coarse-level), thresholded substyle discovery to identify latent pedagogical styles (medium-level), and substyle-aware contrastive learning to align dynamic multimodal cue compositions (fine-level). Experiments on T-MED and CMU-MOSEI demonstrate that CF-TSA consistently outperforms state-of-the-art baselines, validating the effectiveness of the coarse-to-fine perspective and the hierarchical modeling.