Vicomtech@WMT 2025: Evolutionary Model Compression for Machine Translation

David Ponce; Harritxu Gete; Thierry Etchegoyhen

doi:10.18653/v1/2025.wmt-1.77

Vicomtech@WMT 2025: Evolutionary Model Compression for Machine Translation

David Ponce, Harritxu Gete, Thierry Etchegoyhen

Abstract

We describe Vicomtech’s participation in the WMT 2025 Shared Task on Model Compression. We addressed all three language pairs of the constrained task, namely Czech to German, English to Arabic and Japanese to Chinese, using the Aya Expanse 8B model as our base model. Our approach centers on GeLaCo, an evolutionary method for LLM compression via layer collapse operations, which efficiently explores the compression solution space through population-based search and a module-wise similarity fitness function that captures attention, feed-forward, and hidden state representations. We systematically evaluated compression at three different ratios (0.25, 0.50, and 0.75) and applied targeted post-training techniques to recover performance through fine-tuning and knowledge distillation over translation instructions. Additionally, we explored quantization techniques to achieve further model size reduction. Our experimental results demonstrate that the combination of evolutionary layer compression, targeted post-training, and quantization can achieve substantial model size reduction while maintaining competitive translation quality across all language pairs.

Anthology ID:: 2025.wmt-1.77
Volume:: Proceedings of the Tenth Conference on Machine Translation
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
Venue:: WMT
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1011–1021
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.wmt-1.77/
DOI:: 10.18653/v1/2025.wmt-1.77
Bibkey:
Cite (ACL):: David Ponce, Harritxu Gete, and Thierry Etchegoyhen. 2025. Vicomtech@WMT 2025: Evolutionary Model Compression for Machine Translation. In Proceedings of the Tenth Conference on Machine Translation, pages 1011–1021, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Vicomtech@WMT 2025: Evolutionary Model Compression for Machine Translation (Ponce et al., WMT 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.wmt-1.77.pdf

PDF Cite Search Fix data