Formal Abductive Latent Explanations for Prototype-Based Networks

Jules Soria; Zakaria Chihani; Julien Girard-Satabin; Alban Grastien; Romain Xu-Darme; Daniela Cancila

doi:10.1609/aaai.v40i30.39755

Authors

Jules Soria Université Paris-Saclay, CEA, List
Zakaria Chihani Université Paris-Saclay, CEA, List
Julien Girard-Satabin Université Paris-Saclay, CEA, List
Alban Grastien Université Paris-Saclay, CEA, List
Romain Xu-Darme Université Paris-Saclay, CEA, List
Daniela Cancila Université Paris-Saclay, CEA, List

DOI:

https://doi.org/10.1609/aaai.v40i30.39755

Abstract

Case-based reasoning networks are machine-learning models that make predictions based on similarity between the input and prototypical parts of training samples, called prototypes. Such models are able to explain each decision by pointing to the prototypes that contributed the most to the final outcome. As the explanation is a core part of the prediction, they are often qualified as "interpretable by design". While promising, we show that such explanations are sometimes misleading, which hampers their usefulness in safety-critical contexts. In particular, several instances may lead to different predictions and yet have the same explanation. Drawing inspiration from the field of formal eXplainable AI (formal XAI), we propose Abductive Latent Explanations (ALEs), a formalism to express sufficient conditions on the intermediate (latent) representation of the instance that imply the prediction. Our approach combines the inherent interpretability of case-based reasoning models and the guarantees provided by formal XAI. We propose a solver-free and scalable algorithm for generating ALEs based on three distinct paradigms, compare them, and present the feasibility of our approach on diverse datasets for both standard and fine-grained image classification.

Formal Abductive Latent Explanations for Prototype-Based Networks

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information