Cross-lingual Matryoshka Representation Learning across Speech and Text

Yaya SY, Dioula Doucouré, Christophe Cerisara, Irina Illina


Abstract
Speakers of under-represented languages face both a language barrier, as most online knowledge is in a few dominant languages, and a modality barrier, since information is largely text-based while many languages are primarily oral. We address this for French-Wolof by training the first bilingual speech-text Matryoshka embedding model, enabling efficient retrieval of French text from Wolof speech queries without relying on a costly ASR-translation pipelines. We introduce large-scale data curation pipelines and new benchmarks, compare modeling strategies, and show that modality fusion within a frozen text Matryoshka model performs best. Although trained only for retrieval, the model generalizes well to other tasks, such as speech intent detection, indicating the learning of general semantic representations. Finally, we analyze cost-accuracy trade-offs across Matryoshka dimensions and ranks, showing that information is concentrated only in a few components, suggesting potential for efficiency improvements.
Anthology ID:
2026.findings-acl.1710
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
34218–34228
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.findings-acl.1710/
DOI:
Bibkey:
Cite (ACL):
Yaya SY, Dioula Doucouré, Christophe Cerisara, and Irina Illina. 2026. Cross-lingual Matryoshka Representation Learning across Speech and Text. In Findings of the Association for Computational Linguistics: ACL 2026, pages 34218–34228, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Cross-lingual Matryoshka Representation Learning across Speech and Text (SY et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.findings-acl.1710.pdf
Checklist:
 2026.findings-acl.1710.checklist.pdf