Quantum-Infused Whisper: A Framework for Replacing Classical Components

Tapabrata Mondal, Debjit Dhar, Soham Lahiri, Sivaji Bandyopadhyay


Abstract
We propose a compact hybrid quantum–classical extension of OpenAI’s Whisper in which classical components are replaced by Quantum Convolutional Neural Networks (QCNN), Quantum LSTMs (QLSTM), and optional Quantum Adaptive Self-Attention (QASA). Log-mel spectrograms are angle encoded and processed by QCNN kernels, whose outputs feed a Transformer encoder, while QLSTM-based decoding introduces quantum-enhanced temporal modeling. The design incorporates pretrained acoustic embeddings and is constrained to NISQ-feasible circuit depths and qubit counts. Although this work is primarily architectural, we provide a fully specified, reproducible evaluation plan using Speech Commands, LibriSpeech, and Common Voice, along with strong classical baselines and measurable hypotheses for assessing noise robustness, efficiency, and parameter sparsity. To our knowledge, this is the first hardware-aware, module-wise quantum replacement framework for Whisper.
Anthology ID:
2025.quantumnlp-1.1
Volume:
Proceedings of the QuantumNLP{:} Integrating Quantum Computing with Natural Language Processing
Month:
November
Year:
2025
Address:
Mumbai, India (Hybrid)
Editors:
Santanu Pal, Partha Pakray, Priyanka Jain, Asif Ekbal, Sivaji Bandyopadhyay
Venues:
QuantumNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–5
Language:
URL:
https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.quantumnlp-1.1/
DOI:
Bibkey:
Cite (ACL):
Tapabrata Mondal, Debjit Dhar, Soham Lahiri, and Sivaji Bandyopadhyay. 2025. Quantum-Infused Whisper: A Framework for Replacing Classical Components. In Proceedings of the QuantumNLP{:} Integrating Quantum Computing with Natural Language Processing, pages 1–5, Mumbai, India (Hybrid). Association for Computational Linguistics.
Cite (Informal):
Quantum-Infused Whisper: A Framework for Replacing Classical Components (Mondal et al., QuantumNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.quantumnlp-1.1.pdf