@inproceedings{zhou-etal-2025-dual,
    title = "Dual-Path Dynamic Fusion with Learnable Query for Multimodal Sentiment Analysis",
    author = "Zhou, Miao  and
      Yang, Lina  and
      Wu, Thomas  and
      Yang, Dongnan  and
      Zhang, Xinru",
    editor = "Christodoulopoulos, Christos  and
      Chakraborty, Tanmoy  and
      Rose, Carolyn  and
      Peng, Violet",
    booktitle = "Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2025",
    address = "Suzhou, China",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.571/",
    pages = "11366--11376",
    ISBN = "979-8-89176-332-6",
    abstract = "Multimodal Sentiment Analysis (MSA) is the task of understanding human emotions by analyzing a combination of different data sources, such as text, audio, and visual inputs. Although recent advances have improved emotion modeling across modalities, existing methods still struggle with two fundamental challenges: balancing global and fine-grained sentiment contributions, and over-reliance on the text modality. To address these issues, we propose DPDF-LQ (Dual-Path Dynamic Fusion with Learnable Query), an architecture that processes inputs through two complementary paths: global and local. The global path is responsible for establishing cross-modal dependencies, while the local path captures fine-grained representations. Additionally, we introduce the key module Dynamic Global Learnable Query Attention (DGLQA) in the global path, which dynamically allocates weights to each modality to capture their relevant features and learn global representations. Extensive experiments on the CMU-MOSI and CMU-MOSEI benchmarks demonstrate that DPDF-LQ achieves state-of-the-art performance, particularly in fine-grained sentiment prediction by effectively combining global and local features. Our code will be released at \url{https://github.com/ZhouMiaoGX/DPDF-LQ}."
}Markdown (Informal)
[Dual-Path Dynamic Fusion with Learnable Query for Multimodal Sentiment Analysis](https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.571/) (Zhou et al., EMNLP 2025)
ACL