GoSt-ParC-Sign: Gold Standard Parallel Corpus of Sign and spoken language

Mirella De Sisto, Vincent Vandeghinste, Lien Soetemans, Caro Brosens, Dimitar Shterionov


Abstract
Good quality training data for Sign Language Machine Translation (SLMT) is extremely scarce, and this is one of the challenges that any project focusing on Machine Translation (MT) which also targets sign languages is currently facing. The goal of this ongoing project is to create a parallel corpus of authentic Flemish Sign Language (VGT) and written Dutch which can be employed as gold standard in automated sign language translation. The availability of a gold standard corpus like Gost-ParC-Sign can facilitate the advances of SLMT; consequently, it supports and promotes inclusiveness in MT and, on a more general level, in language technology
Anthology ID:
2023.eamt-1.54
Volume:
Proceedings of the 24th Annual Conference of the European Association for Machine Translation
Month:
June
Year:
2023
Address:
Tampere, Finland
Editors:
Mary Nurminen, Judith Brenner, Maarit Koponen, Sirkku Latomaa, Mikhail Mikhailov, Frederike Schierl, Tharindu Ranasinghe, Eva Vanmassenhove, Sergi Alvarez Vidal, Nora Aranberri, Mara Nunziatini, Carla Parra Escartín, Mikel Forcada, Maja Popovic, Carolina Scarton, Helena Moniz
Venue:
EAMT
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
503–504
Language:
URL:
https://aclanthology.org/2023.eamt-1.54
DOI:
Bibkey:
Cite (ACL):
Mirella De Sisto, Vincent Vandeghinste, Lien Soetemans, Caro Brosens, and Dimitar Shterionov. 2023. GoSt-ParC-Sign: Gold Standard Parallel Corpus of Sign and spoken language. In Proceedings of the 24th Annual Conference of the European Association for Machine Translation, pages 503–504, Tampere, Finland. European Association for Machine Translation.
Cite (Informal):
GoSt-ParC-Sign: Gold Standard Parallel Corpus of Sign and spoken language (Sisto et al., EAMT 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-2024-clasp/2023.eamt-1.54.pdf