SwissSLi: The Multi-parallel Sign Language Corpus for Switzerland
Zifan Jiang, Anne Göhring, Amit Moryossef, Rico Sennrich, Sarah Ebling
Abstract
In this work, we introduce SwissSLi, the first sign language corpus that contains parallel data of all three Swiss sign languages, namely Swiss German Sign Language (DSGS), French Sign Language of Switzerland (LSF-CH), and Italian Sign Language of Switzerland (LIS-CH). The data underlying this corpus originates from television programs in three spoken languages: German, French, and Italian. The programs have for the most part been translated into sign language by deaf translators, resulting in a unique, up to six-way multi-parallel dataset between spoken and sign languages. We describe and release the sign language videos and spoken language subtitles as well as the overall statistics and some derivatives of the raw material. These derived components include cropped videos, pose estimation, phrase/sign-segmented videos, and sentence-segmented subtitles, all of which facilitate downstream tasks such as sign language transcription (glossing) and machine translation. The corpus is publicly available on the SWISSUbase data platform for research purposes only under a CC BY-NC-SA 4.0 license.- Anthology ID:
- 2024.lrec-main.1342
- Volume:
- Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
- Month:
- May
- Year:
- 2024
- Address:
- Torino, Italia
- Editors:
- Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
- Venues:
- LREC | COLING
- SIG:
- Publisher:
- ELRA and ICCL
- Note:
- Pages:
- 15448–15456
- Language:
- URL:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.lrec-main.1342/
- DOI:
- Cite (ACL):
- Zifan Jiang, Anne Göhring, Amit Moryossef, Rico Sennrich, and Sarah Ebling. 2024. SwissSLi: The Multi-parallel Sign Language Corpus for Switzerland. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 15448–15456, Torino, Italia. ELRA and ICCL.
- Cite (Informal):
- SwissSLi: The Multi-parallel Sign Language Corpus for Switzerland (Jiang et al., LREC-COLING 2024)
- PDF:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.lrec-main.1342.pdf