Quantum-Infused Whisper: A Framework for Replacing Classical Components
Tapabrata Mondal, Debjit Dhar, Soham Lahiri, Sivaji Bandyopadhyay
Abstract
We propose a compact hybrid quantum–classical extension of OpenAI’s Whisper in which classical components are replaced by Quantum Convolutional Neural Networks (QCNN), Quantum LSTMs (QLSTM), and optional Quantum Adaptive Self-Attention (QASA). Log-mel spectrograms are angle encoded and processed by QCNN kernels, whose outputs feed a Transformer encoder, while QLSTM-based decoding introduces quantum-enhanced temporal modeling. The design incorporates pretrained acoustic embeddings and is constrained to NISQ-feasible circuit depths and qubit counts. Although this work is primarily architectural, we provide a fully specified, reproducible evaluation plan using Speech Commands, LibriSpeech, and Common Voice, along with strong classical baselines and measurable hypotheses for assessing noise robustness, efficiency, and parameter sparsity. To our knowledge, this is the first hardware-aware, module-wise quantum replacement framework for Whisper.- Anthology ID:
- 2025.quantumnlp-1.1
- Volume:
- Proceedings of the QuantumNLP{:} Integrating Quantum Computing with Natural Language Processing
- Month:
- November
- Year:
- 2025
- Address:
- Mumbai, India (Hybrid)
- Editors:
- Santanu Pal, Partha Pakray, Priyanka Jain, Asif Ekbal, Sivaji Bandyopadhyay
- Venues:
- QuantumNLP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1–5
- Language:
- URL:
- https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.quantumnlp-1.1/
- DOI:
- Cite (ACL):
- Tapabrata Mondal, Debjit Dhar, Soham Lahiri, and Sivaji Bandyopadhyay. 2025. Quantum-Infused Whisper: A Framework for Replacing Classical Components. In Proceedings of the QuantumNLP{:} Integrating Quantum Computing with Natural Language Processing, pages 1–5, Mumbai, India (Hybrid). Association for Computational Linguistics.
- Cite (Informal):
- Quantum-Infused Whisper: A Framework for Replacing Classical Components (Mondal et al., QuantumNLP 2025)
- PDF:
- https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.quantumnlp-1.1.pdf