The Role of Prosody in Spoken Question Answering

Jie Chi, Maureen de Seyssel, Natalie Schluter


Abstract
Spoken language understanding research to date has generally carried a heavy text perspective. Most datasets are derived from text, which is then subsequently synthesized into speech, and most models typically rely on automatic transcriptions of speech. This is to the detriment of prosody–additional information carried by the speech signal beyond the phonetics of the words themselves and difficult to recover from text alone. In this work, we investigate the role of prosody in Spoken Question Answering. By isolating prosodic and lexical information on the SLUE-SQA-5 dataset, which consists of natural speech, we demonstrate that models trained on prosodic information alone can perform reasonably well by utilizing prosodic cues. However, we find that when lexical information is available, models tend to predominantly rely on it. Our findings suggest that while prosodic cues provide valuable supplementary information, more effective integration methods are required to ensure prosody contributes more significantly alongside lexical features.
Anthology ID:
2025.findings-naacl.471
Volume:
Findings of the Association for Computational Linguistics: NAACL 2025
Month:
April
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8468–8479
Language:
URL:
https://preview.aclanthology.org/Ingest-2025-COMPUTEL/2025.findings-naacl.471/
DOI:
Bibkey:
Cite (ACL):
Jie Chi, Maureen de Seyssel, and Natalie Schluter. 2025. The Role of Prosody in Spoken Question Answering. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 8468–8479, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
The Role of Prosody in Spoken Question Answering (Chi et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/Ingest-2025-COMPUTEL/2025.findings-naacl.471.pdf