Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion

Giuseppe Ruggiero, Matteo Testa, Jurgen Van De Walle, Luigi Di Caro


Abstract
The creation of artificial polyglot voices remains a challenging task, despite considerable progress in recent years. This paper investigates self-supervised learning for voice conversion to create native-sounding polyglot voices. We introduce a novel cross-lingual any-to-one voice conversion system that is able to preserve the source accent without the need for multilingual data from the target speaker. In addition, we show a novel cross-lingual fine-tuning strategy that further improves the accent and reduces the training data requirements. Objective and subjective evaluations with English, Spanish, French and Mandarin Chinese confirm that our approach improves on state-of-the-art methods, enhancing the speech intelligibility and overall quality of the converted speech, especially in cross-lingual scenarios. Audio samples are available at: https://giuseppe-ruggiero.github.io/a2o-vc-demo/
Anthology ID:
2024.findings-emnlp.122
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2024
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2237–2246
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2024.findings-emnlp.122/
DOI:
10.18653/v1/2024.findings-emnlp.122
Bibkey:
Cite (ACL):
Giuseppe Ruggiero, Matteo Testa, Jurgen Van De Walle, and Luigi Di Caro. 2024. Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 2237–2246, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion (Ruggiero et al., Findings 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2024.findings-emnlp.122.pdf
Data:
 2024.findings-emnlp.122.data.zip