Evaluation Cards for XAI Metrics

Rokas Gipiškis, Olga Kurasova


Abstract
The evaluation of explainable AI (XAI) methods is affected by a lack of standardization. Metrics are inconsistently defined, incompletely reported, and rarely validated against common baselines. In this paper, we identify transparency of evaluation reporting as a central, under-addressed problem. We propose the XAI Evaluation Card, a documentation template analogous to model cards, designed to accompany any study that introduces an XAI evaluation metric. The card covers explicit declaration of target properties, grounding levels, metric assumptions, validation evidence, gaming risks, and known failure cases. We argue that adopting this template as a community norm would reduce evaluation fragmentation, support meta-analysis, and improve accountability in XAI research.
Anthology ID:
2026.evaleval-1.39
Volume:
Proceedings of the Workshop on Evaluating Evaluations (EvalEval)
Month:
July
Year:
2026
Address:
San Diego, CA
Editors:
Mubashara Akhtar, Jan Batzner, Leshem Choshen, Avijit Ghosh, Usman Gohar, Jennifer Mickel, Ichhya Pant, Zeerak Talat, Michelle Lin
Venues:
EvalEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
245–251
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.evaleval-1.39/
DOI:
Bibkey:
Cite (ACL):
Rokas Gipiškis and Olga Kurasova. 2026. Evaluation Cards for XAI Metrics. In Proceedings of the Workshop on Evaluating Evaluations (EvalEval), pages 245–251, San Diego, CA. Association for Computational Linguistics.
Cite (Informal):
Evaluation Cards for XAI Metrics (Gipiškis & Kurasova, EvalEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.evaleval-1.39.pdf