Bridging Text-to-Sign Translation via Codebook-Oriented Pretraining

Ninlawat Phuangchoke; Chantri Polprasert

Bridging Text-to-Sign Translation via Codebook-Oriented Pretraining

Ninlawat Phuangchoke, Chantri Polprasert

Abstract

Sign Language Production (SLP), the automatic translation from spoken to sign languages, faces several challenges due to the intricate mapping between linguistic semantics and the spatial–temporal motion domain. Existing SLP methods employing a transformer model with a Vector Quantization (VQ) method exhibit poor translation performance due to weak semantic alignment between the codebook and the text representation. In this work, we propose a novel text-to-sign translation based on model pretraining, which enhances semantic alignment by inheriting codebook-oriented prior knowledge from masked self-supervised models. Our approach involves two stages: (i) transforming sign language into discrete values by employing VQ with masked self-attention learning to create pre-tasks that bridge the semantic gap between text and codebook representations, (ii) constructing an end-to-end architecture with an encoder-decoder-like structure that inherits the parameters of the model from the first stage. The integration of these designs forms a robust sign language representation and significantly improves the translation model, which surpass prior baselines.

Anthology ID:: 2026.lrec-main.746
Volume:: Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:: May
Year:: 2026
Address:: Palma de Mallorca, Spain
Editors:: Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:: LREC
SIG:
Publisher:: ELRA Language Resource Association
Note:
Pages:: 9504–9513
Language:
URL:: https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.746/
DOI:
Bibkey:
Cite (ACL):: Ninlawat Phuangchoke and Chantri Polprasert. 2026. Bridging Text-to-Sign Translation via Codebook-Oriented Pretraining. International Conference on Language Resources and Evaluation, main:9504–9513.
Cite (Informal):: Bridging Text-to-Sign Translation via Codebook-Oriented Pretraining (Phuangchoke & Polprasert, LREC 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.746.pdf

PDF Cite Search Fix data