Non-autoregressive Modeling for Sign-gloss to Texts Translation

Fan Zhou; Tim Van de Cruys

Non-autoregressive Modeling for Sign-gloss to Texts Translation

Abstract

Automatic sign language translation has seen significant advancements, driven by progress in computer vision and natural language processing. While end to end sign-to-text translation systems are available, many systems still rely on a gloss-based representation–an intermediate symbolic representation that functions as a bridge between sign language and its written counterpart. This paper focuses on the gloss-to-text (gloss2text) task, a key step in the sign-to-text translation pipeline, which has traditionally been addressed using autoregressive (AR) modeling approaches. In this study, we propose the use of non-autoregressive (NAR) modeling techniques, including non-autoregressive Transformer (NAT) and diffusion models, tailored to the unique characteristics of gloss2text. Specifically, we introduce PointerLevT, a novel NAT-based model designed to enhance performance in this task. Our experiments demonstrate that NAR models achieve higher accuracy than pre-trained AR models with less data, while also matching the performance of fine-tuned AR models such as mBART. Furthermore, we evaluate inference speed and find that NAR models benefit from parallel generation, resulting in faster inference. However, they require more time to achieve an optimal balance between accuracy and speed, particularly in the multistep denoising process of diffusion models.

Anthology ID:: 2025.mtsummit-1.17
Volume:: Proceedings of Machine Translation Summit XX: Volume 1
Month:: June
Year:: 2025
Address:: Geneva, Switzerland
Editors:: Pierrette Bouillon, Johanna Gerlach, Sabrina Girletti, Lise Volkart, Raphael Rubino, Rico Sennrich, Ana C. Farinha, Marco Gaido, Joke Daems, Dorothy Kenny, Helena Moniz, Sara Szoc
Venue:: MTSummit
SIG:
Publisher:: European Association for Machine Translation
Note:
Pages:: 220–230
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.mtsummit-1.17/
DOI:
Bibkey:
Cite (ACL):: Fan Zhou and Tim Van de Cruys. 2025. Non-autoregressive Modeling for Sign-gloss to Texts Translation. In Proceedings of Machine Translation Summit XX: Volume 1, pages 220–230, Geneva, Switzerland. European Association for Machine Translation.
Cite (Informal):: Non-autoregressive Modeling for Sign-gloss to Texts Translation (Zhou & de Cruys, MTSummit 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.mtsummit-1.17.pdf

PDF Cite Search Fix data