Speech Recognition and Synthesis Technologies Applied to Preservation and Revitalization of the Ainu Language

Tatsuya Kawahara, Kohei Matsuura


Abstract
This paper gives an overview of our activities in developing automatic speech recognition (ASR) and text-to-speech (TTS) systems for the preservation and revitalization of the Ainu language, once spoken in the Hokkaido area of Japan, and listed as "severely endangered" of extinction. With a large pretrained model, a high-performing ASR system can be trained even with five hours of speech from a few speakers. It has been used to streamline the transcription and archiving of old recordings. A TTS system is also developed and used for revitalizing the speech of old folktales whose audio is missing. It is also used to provide a reference for speaking practice for new Ainu speakers. Speech technologies are important for endangered languages because their cultures have typically been passed down orally, and our efforts will be useful for passing them on to the future.
Anthology ID:
2026.computel-1.2
Volume:
Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Godfred Agyapong, Sarah Moeller, Antti Arppe, Ali Marashian, Daisy Rosenblum
Venues:
ComputEL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10–14
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.2/
DOI:
Bibkey:
Cite (ACL):
Tatsuya Kawahara and Kohei Matsuura. 2026. Speech Recognition and Synthesis Technologies Applied to Preservation and Revitalization of the Ainu Language. In Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9), pages 10–14, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Speech Recognition and Synthesis Technologies Applied to Preservation and Revitalization of the Ainu Language (Kawahara & Matsuura, ComputEL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.2.pdf
Supplementarymaterial:
 2026.computel-1.2.SupplementaryMaterial.txt
Supplementarymaterial:
 2026.computel-1.2.SupplementaryMaterial.docx