Automatic Named Entity Obfuscation in Speech

Judita Preiss


Abstract
Sharing data containing personal information often requires its anonymization, even when consent for sharing was obtained from the data originator. While approaches exist for automated anonymization of text, the area is not as thoroughly explored in speech. This work focuses on identifying, replacing and inserting replacement named entities synthesized using voice cloning into original audio thereby retaining prosodic information while reducing the likelihood of deanonymization. The approach employs a novel named entity recognition (NER) system built directly on speech by training HuBERT (Hsu et al, 2021) using the English speech NER dataset (Yadav et al, 2020). Name substitutes are found using a masked language model and are synthesized using text to speech voice cloning (Eren and team, 2021), upon which the substitute named entities are re-inserted into the original text. The approach is prototyped on a sample of the LibriSpeech corpus (Panyatov et al, 2015) with each step evaluated individually.
Anthology ID:
2023.findings-acl.39
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
615–622
Language:
URL:
https://aclanthology.org/2023.findings-acl.39
DOI:
10.18653/v1/2023.findings-acl.39
Bibkey:
Cite (ACL):
Judita Preiss. 2023. Automatic Named Entity Obfuscation in Speech. In Findings of the Association for Computational Linguistics: ACL 2023, pages 615–622, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Automatic Named Entity Obfuscation in Speech (Preiss, Findings 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-2023-videos/2023.findings-acl.39.pdf