Building Okinawan Lexicon Resource for Language Reclamation/Revitalization and Natural Language Processing Tasks such as Universal Dependencies Treebanking
So Miyagawa, Kanji Kato, Miho Zlazli, Salvatore Carlino, Seira Machida
Abstract
The Open Multilingual Online Lexicon of Okinawan (OMOLO) project aims to create an accessible, user-friendly digital lexicon for the endangered Okinawan language using digital humanities tools and methodologies. The multilingual web application, available in Japanese, English, Portuguese, and Spanish, will benefit language learners, researchers, and the Okinawan community in Japan and diaspora countries such as the U.S., Brazil, and Peru. The project also lays the foundation for an Okinawan UD Treebank, which will support computational analysis and the development of language technology tools such as parsers, machine translation systems, and speech recognition software. The OMOLO project demonstrates the potential of computational linguistics in preserving and revitalizing endangered languages and can serve as a blueprint for similar initiatives.- Anthology ID:
- 2023.resourceful-1.12
- Volume:
- Proceedings of the Second Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2023)
- Month:
- May
- Year:
- 2023
- Address:
- Tórshavn, the Faroe Islands
- Editors:
- Nikolai Ilinykh, Felix Morger, Dana Dannélls, Simon Dobnik, Beáta Megyesi, Joakim Nivre
- Venue:
- RESOURCEFUL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 86–91
- Language:
- URL:
- https://aclanthology.org/2023.resourceful-1.12
- DOI:
- Cite (ACL):
- So Miyagawa, Kanji Kato, Miho Zlazli, Salvatore Carlino, and Seira Machida. 2023. Building Okinawan Lexicon Resource for Language Reclamation/Revitalization and Natural Language Processing Tasks such as Universal Dependencies Treebanking. In Proceedings of the Second Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2023), pages 86–91, Tórshavn, the Faroe Islands. Association for Computational Linguistics.
- Cite (Informal):
- Building Okinawan Lexicon Resource for Language Reclamation/Revitalization and Natural Language Processing Tasks such as Universal Dependencies Treebanking (Miyagawa et al., RESOURCEFUL 2023)
- PDF:
- https://preview.aclanthology.org/proper-vol2-ingestion/2023.resourceful-1.12.pdf