Ivan Zhang
2025
Command-A-Translate: Raising the Bar of Machine Translation with Difficulty Filtering
Tom Kocmi
|
Arkady Arkhangorodsky
|
Alexandre Berard
|
Phil Blunsom
|
Samuel Cahyawijaya
|
Théo Dehaze
|
Marzieh Fadaee
|
Nicholas Frosst
|
Matthias Galle
|
Aidan Gomez
|
Nithya Govindarajan
|
Wei-Yin Ko
|
Julia Kreutzer
|
Kelly Marchisio
|
Ahmet Üstün
|
Sebastian Vincent
|
Ivan Zhang
Proceedings of the Tenth Conference on Machine Translation
We present Command A Translate, an LLMbased machine translation model built off Cohere’s Command A. It reaches state-of-the-art machine translation quality via direct preference optimization. Our meticulously designed data preparation pipeline emphasizes robust quality control and a novel difficulty filtering – a key innovation that distinguishes Command A Translate. Furthermore, we extend our model and participate at WMT with a system (CommandA-WMT) that uses two models and post-editing steps of step-by-step reasoning and limited Minimum Bayes Risk decoding.