Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool
Medet Mukushev, Aigerim Kydyrbekova, Vadim Kimmelman, Anara Sandygulova
Abstract
This paper presents a new dataset for Kazakh-Russian Sign Language (KRSL) created for the purposes of Sign Language Processing. In 2020, Kazakhstan’s schools were quickly switched to online mode due to the COVID-19 pandemic. Every working day, the El-arna TV channel was broadcasting video lessons for grades from 1 to 11 with sign language translation. This opportunity allowed us to record a corpus with a large vocabulary and spontaneous SL interpretation. To this end, this corpus contains video recordings of Kazakhstan’s online school translated to Kazakh-Russian sign language by 7 interpreters. At the moment we collected and cleaned 890 hours of video material. A custom annotation tool was created to make the process of data annotation simple and easy-to-use by the Deaf community. To date, around 325 hours of videos have been annotated with glosses and 4,009 lessons out of 4,547 were transcribed with automatic speech-to-text software. The KRSL-OnlineSchool dataset will be made publicly available at https://krslproject.github.io/online-school/- Anthology ID:
- 2022.signlang-1.24
- Volume:
- Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources
- Month:
- June
- Year:
- 2022
- Address:
- Marseille, France
- Venue:
- SignLang
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 154–158
- Language:
- URL:
- https://aclanthology.org/2022.signlang-1.24
- DOI:
- Cite (ACL):
- Medet Mukushev, Aigerim Kydyrbekova, Vadim Kimmelman, and Anara Sandygulova. 2022. Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool. In Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources, pages 154–158, Marseille, France. European Language Resources Association.
- Cite (Informal):
- Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool (Mukushev et al., SignLang 2022)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2022.signlang-1.24.pdf
- Data
- How2Sign