Playing with Voices: Tabletop Role-Playing Game Recordings as a Diarization Challenge

Lian Remme, Kevin Tang


Abstract
This paper provides a proof of concept that audio of tabletop role-playing games (TTRPG) could serve as a challenge for diarization systems. TTRPGs are carried out mostly by conversation. Participants often alter their voices to indicate that they are talking as a fictional character. Audio processing systems are susceptible to voice conversion with or without technological assistance. TTRPG present a conversational phenomenon in which voice conversion is an inherent characteristic for an immersive gaming experience. This could make it more challenging for diarizers to pick the real speaker and determine that impersonating is just that. We present the creation of a small TTRPG audio dataset and compare it against the AMI and the ICSI corpus. The performance of two diarizers, pyannote.audio and wespeaker, were evaluated. We observed that TTRPGs’ properties result in a higher confusion rate for both diarizers.Additionally, wespeaker strongly underestimates the number of speakers in the TTRPG audio files.We propose TTRPG audio as a promising challenge for diarization systems.
Anthology ID:
2025.findings-naacl.280
Volume:
Findings of the Association for Computational Linguistics: NAACL 2025
Month:
April
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4969–4983
Language:
URL:
https://preview.aclanthology.org/moar-dois/2025.findings-naacl.280/
DOI:
10.18653/v1/2025.findings-naacl.280
Bibkey:
Cite (ACL):
Lian Remme and Kevin Tang. 2025. Playing with Voices: Tabletop Role-Playing Game Recordings as a Diarization Challenge. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 4969–4983, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
Playing with Voices: Tabletop Role-Playing Game Recordings as a Diarization Challenge (Remme & Tang, Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/moar-dois/2025.findings-naacl.280.pdf