KSoF: The Kassel State of Fluency Dataset – A Therapy Centered Dataset of Stuttering

Sebastian Bayerl, Alexander Wolff von Gudenberg, Florian Hönig, Elmar Noeth, Korbinian Riedhammer


Abstract
Stuttering is a complex speech disorder that negatively affects an individual’s ability to communicate effectively. Persons who stutter (PWS) often suffer considerably under the condition and seek help through therapy. Fluency shaping is a therapy approach where PWSs learn to modify their speech to help them to overcome their stutter. Mastering such speech techniques takes time and practice, even after therapy. Shortly after therapy, success is evaluated highly, but relapse rates are high. To be able to monitor speech behavior over a long time, the ability to detect stuttering events and modifications in speech could help PWSs and speech pathologists to track the level of fluency. Monitoring could create the ability to intervene early by detecting lapses in fluency. To the best of our knowledge, no public dataset is available that contains speech from people who underwent stuttering therapy that changed the style of speaking. This work introduces the Kassel State of Fluency (KSoF), a therapy-based dataset containing over 5500 clips of PWSs. The clips were labeled with six stuttering-related event types: blocks, prolongations, sound repetitions, word repetitions, interjections, and – specific to therapy – speech modifications. The audio was recorded during therapy sessions at the Institut der Kasseler Stottertherapie. The data will be made available for research purposes upon request.
Anthology ID:
2022.lrec-1.189
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
1780–1787
Language:
URL:
https://aclanthology.org/2022.lrec-1.189
DOI:
Bibkey:
Cite (ACL):
Sebastian Bayerl, Alexander Wolff von Gudenberg, Florian Hönig, Elmar Noeth, and Korbinian Riedhammer. 2022. KSoF: The Kassel State of Fluency Dataset – A Therapy Centered Dataset of Stuttering. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 1780–1787, Marseille, France. European Language Resources Association.
Cite (Informal):
KSoF: The Kassel State of Fluency Dataset – A Therapy Centered Dataset of Stuttering (Bayerl et al., LREC 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2022.lrec-1.189.pdf
Data
KSoFLibriSpeechSEP-28k