Continuous Fingerspelling Dataset for Indian Sign Language
Kirandevraj R, Vinod K. Kurmi, Vinay P. Namboodiri, C.v. Jawahar
Abstract
Fingerspelling enables signers to represent proper nouns and technical terms letter-by-letter using manual alphabets, yet remains severely under-resourced for Indian Sign Language (ISL). We present the first continuous fingerspelling dataset for ISL, extracted from the ISH News YouTube channel, in which fingerspelling is accompanied by synchronized on-screen text cues. The dataset comprises 1,308 segments from 499 videos, totaling 70.85 minutes and 14,814 characters, with aligned video-text pairs capturing authentic coarticulation patterns. We validated the dataset quality through annotation using a proficient ISL interpreter, achieving a 90.67% exact match rate for 150 samples. We further established baseline recognition benchmarks using a ByT5-small encoder-decoder model, which attains 82.91% Character Error Rate after fine-tuning. This resource supports multiple downstream tasks, including fingerspelling transcription, temporal localization, and sign generation. The dataset is available at the following link: https://kirandevraj.github.io/ISL-Fingerspelling/.- Anthology ID:
- 2025.wslp-main.6
- Volume:
- Proceedings of the Workshop on Sign Language Processing (WSLP)
- Month:
- December
- Year:
- 2025
- Address:
- IIT Bombay, Mumbai, India (Co-located with IJCNLP–AACL 2025)
- Editors:
- Mohammed Hasanuzzaman, Facundo Manuel Quiroga, Ashutosh Modi, Sabyasachi Kamila, Keren Artiaga, Abhinav Joshi, Sanjeet Singh
- Venues:
- WSLP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 33–38
- Language:
- URL:
- https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.wslp-main.6/
- DOI:
- Cite (ACL):
- Kirandevraj R, Vinod K. Kurmi, Vinay P. Namboodiri, and C.v. Jawahar. 2025. Continuous Fingerspelling Dataset for Indian Sign Language. In Proceedings of the Workshop on Sign Language Processing (WSLP), pages 33–38, IIT Bombay, Mumbai, India (Co-located with IJCNLP–AACL 2025). Association for Computational Linguistics.
- Cite (Informal):
- Continuous Fingerspelling Dataset for Indian Sign Language (R et al., WSLP 2025)
- PDF:
- https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.wslp-main.6.pdf