Error Detection for Multimodal Classification

Thomas Bonnier

doi:10.18653/v1/2025.trustnlp-main.6

Error Detection for Multimodal Classification

Abstract

Machine learning models have proven to be useful in various key applications such as autonomous driving or diagnosis prediction. When a model is implemented under real-world conditions, it is thus essential to detect potential errors with a trustworthy approach. This monitoring practice will render decision-making safer by avoiding catastrophic failures. In this paper, the focus is on multimodal classification. We introduce a method that addresses error detection based on unlabeled data. It leverages fused representations and computes the probability that a model will fail based on detected fault patterns in validation data. To improve transparency, we employ a sampling-based approximation of Shapley values in multimodal settings in order to explain why a prediction is assessed as erroneous in terms of feature values. Further, as explanation methods can sometimes disagree, we suggest evaluating the consistency of explanations produced by different value functions and algorithms. To show the relevance of our method, we measure it against a selection of 9 baselines from various domains on tabular-text and text-image datasets, and 2 multimodal fusion strategies for the classification models. Lastly, we show the usefulness of our explanation algorithm on misclassified samples.

Anthology ID:: 2025.trustnlp-main.6
Volume:: Proceedings of the 5th Workshop on Trustworthy NLP (TrustNLP 2025)
Month:: May
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Trista Cao, Anubrata Das, Tharindu Kumarage, Yixin Wan, Satyapriya Krishna, Ninareh Mehrabi, Jwala Dhamala, Anil Ramakrishna, Aram Galystan, Anoop Kumar, Rahul Gupta, Kai-Wei Chang
Venues:: TrustNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 66–81
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.trustnlp-main.6/
DOI:: 10.18653/v1/2025.trustnlp-main.6
Bibkey:
Cite (ACL):: Thomas Bonnier. 2025. Error Detection for Multimodal Classification. In Proceedings of the 5th Workshop on Trustworthy NLP (TrustNLP 2025), pages 66–81, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Error Detection for Multimodal Classification (Bonnier, TrustNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.trustnlp-main.6.pdf

PDF Cite Search Fix data