Hacking Neural Evaluation Metrics with Single Text

Hiroyuki Deguchi; Katsuki Chousa; Yusuke Sakai

Hacking Neural Evaluation Metrics with Single Text

Hiroyuki Deguchi, Katsuki Chousa, Yusuke Sakai

Abstract

Strongly human-correlated evaluation metrics serve as an essential compass for the development and improvement of generation models and must be highly reliable and robust.Recent embedding-based neural text evaluation metrics, such as COMET for translation tasks, are widely used in both research and development fields.However, there is no guarantee that they yield reliable evaluation results due to the black-box nature of neural networks.To raise concerns about the reliability and safety of such metrics, we propose a method for finding a single adversarial text in the discrete space that is consistently evaluated as high-quality, regardless of the test cases, to identify the vulnerabilities in evaluation metrics.The single hub text found with our method achieved 79.1 COMET% and 67.8 COMET% in the WMT’24 English-to-Japanese (En–Ja) and English-to-German (En–De) translation tasks, respectively, outperforming translations generated individually for each source sentence by using M2M100, a general translation model.Furthermore, we also confirmed that the hub text found with our method generalizes across multiple language pairs such as Ja–En and De–En.

Anthology ID:: 2026.eacl-short.13
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 198–206
Language:
URL:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-short.13/
DOI:
Bibkey:
Cite (ACL):: Hiroyuki Deguchi, Katsuki Chousa, and Yusuke Sakai. 2026. Hacking Neural Evaluation Metrics with Single Text. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 2: Short Papers), pages 198–206, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Hacking Neural Evaluation Metrics with Single Text (Deguchi et al., EACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-short.13.pdf
Checklist:: 2026.eacl-short.13.checklist.pdf

PDF Cite Search Checklist Fix data