What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels

Aditya Yadavalli, Tiago Pimentel, Tamar I Regev, Ethan Gotlieb Wilcox, Alex Warstadt


Abstract
Prosody—the melody of speech—conveys critical information often not captured by the words or text of a message.In this paper, we propose an information-theoretic approach to quantify how much is conveyed by prosody that is not recoverable from text alone, and, crucially, what prosody conveys.Our approach applies large speech and language models to estimate the mutual information between a particular dimension of an utterance’s meaning (e.g., its emotion) and any of its communication channels (e.g., audio or text).We then use this approach to quantify the information conveyed by audio and text about sarcasm, emotion, and questionhood, using speech from television and podcasts.We find that for sarcasm and emotion, the audio channel, and by implication the prosodic channel, transmits over an order of magnitude more information about these features than the text channel alone, at least when long-term context beyond the current sentence is unavailable.For questionhood, prosody provides comparatively less additional information.We conclude by outlining a program applying our approach to more dimensions of meaning, communication channels, and languages.
Anthology ID:
2026.acl-long.1085
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
23665–23679
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1085/
DOI:
Bibkey:
Cite (ACL):
Aditya Yadavalli, Tiago Pimentel, Tamar I Regev, Ethan Gotlieb Wilcox, and Alex Warstadt. 2026. What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 23665–23679, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels (Yadavalli et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1085.pdf
Checklist:
 2026.acl-long.1085.checklist.pdf