Miguel Moura Ramos
2026
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Miguel Moura Ramos | Tomás Almeida | Daniel Vareta | Filipe Azevedo | Sweta Agrawal | Patrick Fernandes | André F. T. Martins
Transactions of the Association for Computational Linguistics, Volume 14
Miguel Moura Ramos | Tomás Almeida | Daniel Vareta | Filipe Azevedo | Sweta Agrawal | Patrick Fernandes | André F. T. Martins
Transactions of the Association for Computational Linguistics, Volume 14
Reinforcement learning (RL) has been proven to be an effective and robust method for training neural machine translation systems, especially when paired with powerful reward models that accurately assess translation quality. However, most research has focused on RL methods that use sentence-level feedback, leading to inefficient learning signals due to the reward sparsity problem—the model receives a single score for the entire sentence. To address this, we propose a novel approach that leverages fine-grained, token-level quality assessments along with error severity levels using RL methods. Specifically, we use xCOMET, a state-of-the-art quality estimation system, as our token-level reward model. We conduct experiments on small and large translation datasets with standard encoder-decoder and large language models-based machine translation systems, comparing the impact of sentence-level versus fine-grained reward signals on translation quality. Our results show that training with token-level rewards improves translation quality across language pairs over baselines according to both automatic and human evaluation. Furthermore, token-level reward optimization improves training stability, evidenced by a steady increase in mean rewards over training epochs.
AMALIA: A Fully Open Large Language Model for European Portuguese
Afonso Simplício | Gonçalo Vinagre | Miguel Moura Ramos | Diogo Tavares | Rafael Ferreira | Giuseppe Attanasio | Duarte M. Alves | Inês Calvo | Inês Vieira | Rui Guerra | James Furtado | Beatriz Canaverde | Iago Paulo | Vasco Ramos | Diogo Glória-Silva | Miguel Faria | Marcos Treviso | Daniel Gomes | Pedro Gomes | David Semedo | André Martins | João Magalhães
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Afonso Simplício | Gonçalo Vinagre | Miguel Moura Ramos | Diogo Tavares | Rafael Ferreira | Giuseppe Attanasio | Duarte M. Alves | Inês Calvo | Inês Vieira | Rui Guerra | James Furtado | Beatriz Canaverde | Iago Paulo | Vasco Ramos | Diogo Glória-Silva | Miguel Faria | Marcos Treviso | Daniel Gomes | Pedro Gomes | David Semedo | André Martins | João Magalhães
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Despite rapid progress in open large language models (LLMs), European Portuguese (pt-PT) remains underrepresented in both training data and native evaluation, with machine-translated benchmarks likely missing the variant’s linguistic and cultural nuances. We introduce AMALIA, a fully open LLM that prioritizes pt-PT by using more high-quality pt-PT data during both the mid- and post-training stages. To evaluate pt-PT more faithfully, we release a suite of pt-PT benchmarks that includes translated standard tasks and four new datasets targeting pt-PT generation, linguistic competence, and pt-PT/pt-BR bias. Experiments show that AMALIA matches strong baselines on translated benchmarks while substantially improving performance on pt-PT-specific evaluations, supporting the case for targeted training and native benchmarking for European Portuguese.
Search
Fix author
Co-authors
- André F. T. Martins 2
- Sweta Agrawal 1
- Tomás Almeida 1
- Duarte M. Alves 1
- Giuseppe Attanasio 1
- Filipe Azevedo 1
- Inês Calvo 1
- Beatriz Canaverde 1
- Miguel Faria 1
- Patrick Fernandes 1
- Rafael Ferreira 1
- James Furtado 1
- Diogo Glória-Silva 1
- Daniel Gomes 1
- Pedro Gomes 1
- Rui Guerra 1
- João Magalhães 1
- Iago Paulo 1
- Vasco Ramos 1
- David Semedo 1
- Afonso Simplício 1
- Diogo Tavares 1
- Marcos Treviso 1
- Daniel Vareta 1
- Inês Vieira 1
- Gonçalo Vinagre 1