Beyond Sentence-level Labels: Integrating Conversational Context and Personal Experience for Natural Emotional Expression

Haiyang Sun, Chenyang Le, Wei Wang, Leying Zhang, Chuang Li, Bing Han, Chenda Li, Mengxiao Bi, Yanmin Qian


Abstract
Emotional Text-to-Speech aims to synthesize speech with human-like naturalness and expressiveness. However, existing systems rely on sentence-level labels, which fails to capture the subtle nuances of human affect. Based on cognitive appraisal theories, we argue that emotional expression is not generated in isolation but is deeply influenced by speaker’s Personal Experience and the conversational Context.To overcome the information bottleneck inherent in traditional annotations, we present Emotional-Context-Speech, a large-scale, context-aware speech corpus derived from multi-speaker audiobooks. This dataset provides not only transcriptions but also dialogue context, personal experience, open-vocabulary emotion labels, and paralinguistic descriptions.Experimental results demonstrate that TTS model trained using additional context and experience descriptions as inputs, called Emotional-Context-TTS, significantly outperforms existing methods in terms of emotional expression accuracy and naturalness.
Anthology ID:
2026.findings-acl.940
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
18839–18854
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.940/
DOI:
Bibkey:
Cite (ACL):
Haiyang Sun, Chenyang Le, Wei Wang, Leying Zhang, Chuang Li, Bing Han, Chenda Li, Mengxiao Bi, and Yanmin Qian. 2026. Beyond Sentence-level Labels: Integrating Conversational Context and Personal Experience for Natural Emotional Expression. In Findings of the Association for Computational Linguistics: ACL 2026, pages 18839–18854, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Beyond Sentence-level Labels: Integrating Conversational Context and Personal Experience for Natural Emotional Expression (Sun et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.940.pdf
Checklist:
 2026.findings-acl.940.checklist.pdf