SignAlignLM: Integrating Multimodal Sign Language Processing into Large Language Models

Mert Inan; Anthony Sicilia; Malihe Alikhani

doi:10.18653/v1/2025.findings-acl.190

SignAlignLM: Integrating Multimodal Sign Language Processing into Large Language Models

Mert Inan, Anthony Sicilia, Malihe Alikhani

Abstract

Deaf and Hard-of-Hearing (DHH) users increasingly utilize Large Language Models (LLMs), yet face significant challenges due to these models’ limited understanding of sign language grammar, multimodal sign inputs, and Deaf cultural contexts. Further, current approaches that try to address these limitations, frequently reduce sign language processing (SLP) to traditional translation tasks, neglecting the multimodal and linguistic complexity inherent in signed languages. In this paper, we present an empirical investigation informed by learning theory into natively integrating sign language support within LLMs, directly addressing the documented needs of DHH users. We introduce the first text-based and multimodal LLMs capable of sign language processing called SignAlignLM, and propose new prompting and fine-tuning strategies incorporating sign linguistic rules and conventions. We show that LLMs can be generalized interfaces for both spoken and signed languages if trained with a multitasking paradigm. Our code and model checkpoints are open-source.

Anthology ID:: 2025.findings-acl.190
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3691–3706
Language:
URL:: https://preview.aclanthology.org/transition-to-people-yaml/2025.findings-acl.190/
DOI:: 10.18653/v1/2025.findings-acl.190
Bibkey:
Cite (ACL):: Mert Inan, Anthony Sicilia, and Malihe Alikhani. 2025. SignAlignLM: Integrating Multimodal Sign Language Processing into Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2025, pages 3691–3706, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: SignAlignLM: Integrating Multimodal Sign Language Processing into Large Language Models (Inan et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/transition-to-people-yaml/2025.findings-acl.190.pdf

PDF Cite Search Fix data