Formal Abductive Latent Explanations for Prototype-Based Networks

Authors

  • Jules Soria Université Paris-Saclay, CEA, List
  • Zakaria Chihani Université Paris-Saclay, CEA, List
  • Julien Girard-Satabin Université Paris-Saclay, CEA, List
  • Alban Grastien Université Paris-Saclay, CEA, List
  • Romain Xu-Darme Université Paris-Saclay, CEA, List
  • Daniela Cancila Université Paris-Saclay, CEA, List

DOI:

https://doi.org/10.1609/aaai.v40i30.39755

Abstract

Case-based reasoning networks are machine-learning models that make predictions based on similarity between the input and prototypical parts of training samples, called prototypes. Such models are able to explain each decision by pointing to the prototypes that contributed the most to the final outcome. As the explanation is a core part of the prediction, they are often qualified as "interpretable by design". While promising, we show that such explanations are sometimes misleading, which hampers their usefulness in safety-critical contexts. In particular, several instances may lead to different predictions and yet have the same explanation. Drawing inspiration from the field of formal eXplainable AI (formal XAI), we propose Abductive Latent Explanations (ALEs), a formalism to express sufficient conditions on the intermediate (latent) representation of the instance that imply the prediction. Our approach combines the inherent interpretability of case-based reasoning models and the guarantees provided by formal XAI. We propose a solver-free and scalable algorithm for generating ALEs based on three distinct paradigms, compare them, and present the feasibility of our approach on diverse datasets for both standard and fine-grained image classification.

Published

2026-03-14

How to Cite

Soria, J., Chihani, Z., Girard-Satabin, J., Grastien, A., Xu-Darme, R., & Cancila, D. (2026). Formal Abductive Latent Explanations for Prototype-Based Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 40(30), 25590–25598. https://doi.org/10.1609/aaai.v40i30.39755

Issue

Section

AAAI Technical Track on Machine Learning VII