SOMVOICE: A First Dataset to Study the Effects of Sleep Deprivation on Voice Characteristics of Healthy French Speakers

Vincent P. Martin, Jean-Luc Rouas, Colleen Beaumard, Pierre Philip


Abstract
Excessive sleepiness is a significant public health issue and a critical personal health indicator associated with various disorders. Given its high prevalence in the general population, clinicians need tools to regularly measure patients’ sleepiness levels in natural settings, such as automatic speech analysis. In this article, we introduce the SOMVOICE corpus, the first French corpus containing read-speech recordings from the same participants either after a normal night or after a night of total sleep deprivation. Participants were included according to strict inclusion and exclusion criteria based on both medical characteristics and reading proficiency. The recordings were labelled with both objective and subjective measures of sleepiness, as well as fatigue and anxiety. After introducing the data-collection methodology, we use linear mixed models to conduct a preliminary investigation of the effect of total sleep deprivation on the collected sleepiness-related measures and on participants’ reading behaviour. Doing so, we found that sleep deprivation strongly influences objective and subjective sleepiness measurements as well as fatigue self-reports, but has a lesser effect on anxiety. Regarding reading behaviour, sleep deprivation is associated with a lower speech rate (duration of the recordings and phoneme rate) and more pauses (number of pauses and pause ratio)
Anthology ID:
2026.lrec-main.438
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
5597–5606
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.438/
DOI:
Bibkey:
Cite (ACL):
Vincent P. Martin, Jean-Luc Rouas, Colleen Beaumard, and Pierre Philip. 2026. SOMVOICE: A First Dataset to Study the Effects of Sleep Deprivation on Voice Characteristics of Healthy French Speakers. International Conference on Language Resources and Evaluation, main:5597–5606.
Cite (Informal):
SOMVOICE: A First Dataset to Study the Effects of Sleep Deprivation on Voice Characteristics of Healthy French Speakers (Martin et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.438.pdf