Alex Terentowicz
2025
How (un)faithful are explainable LLM-based NLG metrics?
Alex Terentowicz
|
Mateusz Lango
|
Ondrej Dusek
Proceedings of the 18th International Natural Language Generation Conference
Explainable NLG metrics are becoming a popular research topic; however, the faithfulness of the explanations they provide is typically not evaluated. In this work, we propose a testbed for assessing the faithfulness of span-based metrics by performing controlled perturbations of their explanations and observing changes in the final score. We show that several popular LLM evaluators do not consistently produce faithful explanations.