Abstract
Multi-modal data analysis presents formidable challenges, as developing effective methods to capture correlations among different modalities remains an ongoing pursuit. In this study, we address multi-modal sentiment analysis through a novel quantum perspective. We propose that quantum principles, such as superposition, entanglement, and interference, offer a more comprehensive framework for capturing not only the cross-modal interactions between text, acoustics, and visuals but also the intricate relations within each modality. To empirically evaluate our approach, we employ the CMUMOSEI dataset as our testbed and utilize Qiskit by IBM to run our experiments on a quantum computer. Our proposed Quantum-Enhanced Multi-Modal Analysis Framework (QeMMA) showcases its significant potential by surpassing the baseline by 3.52% and 10.14% in terms of accuracy and F1 score, respectively, highlighting the promise of quantum-inspired methodologies.- Anthology ID:
- 2023.icon-1.84
- Volume:
- Proceedings of the 20th International Conference on Natural Language Processing (ICON)
- Month:
- December
- Year:
- 2023
- Address:
- Goa University, Goa, India
- Editors:
- Jyoti D. Pawar, Sobha Lalitha Devi
- Venue:
- ICON
- SIG:
- SIGLEX
- Publisher:
- NLP Association of India (NLPAI)
- Note:
- Pages:
- 815–821
- Language:
- URL:
- https://aclanthology.org/2023.icon-1.84
- DOI:
- Cite (ACL):
- Arpan Phukan and Asif Ekbal. 2023. QeMMA: Quantum-Enhanced Multi-Modal Sentiment Analysis. In Proceedings of the 20th International Conference on Natural Language Processing (ICON), pages 815–821, Goa University, Goa, India. NLP Association of India (NLPAI).
- Cite (Informal):
- QeMMA: Quantum-Enhanced Multi-Modal Sentiment Analysis (Phukan & Ekbal, ICON 2023)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2023.icon-1.84.pdf