A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations

Lingjun Zhao, Hal Daumé Iii


Abstract
Faithful free-text explanations are important to ensure transparency in high-stakes AI decision-making contexts, but they are challenging to generate by language models and assess by humans. In this paper, we present a measure for Prediction-EXplanation (PEX) consistency, by extending the concept of weight of evidence. This measure quantifies how much a free-text explanation supports or opposes a prediction, serving as an important aspect of explanation faithfulness. Our analysis reveals that more than 62% explanations generated by large language models lack this consistency. We show that applying direct preference optimization improves the consistency of generated explanations across three model families, with improvement ranging from 43.1% to 292.3%. Furthermore, we demonstrate that optimizing this consistency measure can improve explanation faithfulness by up to 9.7%.
Anthology ID:
2025.emnlp-main.797
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
15810–15824
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.797/
DOI:
Bibkey:
Cite (ACL):
Lingjun Zhao and Hal Daumé Iii. 2025. A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 15810–15824, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations (Zhao & Iii, EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.797.pdf
Checklist:
 2025.emnlp-main.797.checklist.pdf