A Bayesian Optimization Approach to Machine Translation Reranking

Julius Cheng; Maike Züfle; Vilém Zouhar; Andreas Vlachos

A Bayesian Optimization Approach to Machine Translation Reranking

Julius Cheng, Maike Züfle, Vilém Zouhar, Andreas Vlachos

Abstract

Reranking, or scoring a list of prediction candidates from a machine translation system with an external scoring model and returning the highest-scoring candidate, remains a simple and effective method for improving prediction quality. However, reranking with high quality scoring models can add substantial computational cost to the translation pipeline, which we address in this work by framing list reranking as a Bayesian optimization (BayesOpt) problem over the candidate list, where unknown scores are modeled with a Gaussian process. This algorithm scores candidates iteratively, choosing next candidates by balancing between exploration, choosing to score those that differ from candidates already scored, and exploitation, choosing to score those that resemble high-scoring candidates.This procedure finds high-scoring candidates while scoring only a fraction of the candidates list; given candidate lists of 200 random samples (before deduplication), our method achieves the same CometKiwi score using only 70 scoring evaluations on average compared to scoring a random subset of 180 candidates. We also propose multi-fidelity BayesOpt for list reranking, where scores obtained from a noisier but cheaper proxy scoring model are incorporated into the search process. We show that well-trained distilled proxy scorers can further improve the performance of BayesOpt.

Anthology ID:: 2025.naacl-long.145
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2849–2862
Language:
URL:: https://preview.aclanthology.org/landing_page/2025.naacl-long.145/
DOI:
Bibkey:
Cite (ACL):: Julius Cheng, Maike Züfle, Vilém Zouhar, and Andreas Vlachos. 2025. A Bayesian Optimization Approach to Machine Translation Reranking. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 2849–2862, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: A Bayesian Optimization Approach to Machine Translation Reranking (Cheng et al., NAACL 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/landing_page/2025.naacl-long.145.pdf

PDF Cite Search Fix data