Trading Complexity for Sparsity in Random Forest Explanations

Authors

  • Gilles Audemard CRIL-CNRS, U. Artois
  • Steve Bellart CRIL-CNRS, U. Artois
  • Louènas Bounia CRIL-CNRS, U. Artois
  • Frédéric Koriche CRIL-CNRS, U. Artois
  • Jean-Marie Lagniez CRIL-CNRS, U. Artois
  • Pierre Marquis CRIL-CNRS, U. Artois, IUF

DOI:

https://doi.org/10.1609/aaai.v36i5.20484

Keywords:

Knowledge Representation And Reasoning (KRR)

Abstract

Random forests have long been considered as powerful model ensembles in machine learning. By training multiple decision trees, whose diversity is fostered through data and feature subsampling, the resulting random forest can lead to more stable and reliable predictions than a single decision tree. This however comes at the cost of decreased interpretability: while decision trees are often easily interpretable, the predictions made by random forests are much more difficult to understand, as they involve a majority vote over multiple decision trees. In this paper, we examine different types of reasons that explain "why" an input instance is classified as positive or negative by a Boolean random forest. Notably, as an alternative to prime-implicant explanations taking the form of subset-minimal implicants of the random forest, we introduce majoritary reasons which are subset-minimal implicants of a strict majority of decision trees. For these abductive explanations, the tractability of the generation problem (finding one reason) and the optimization problem (finding one minimum-sized reason) are investigated. Unlike prime-implicant explanations, majoritary reasons may contain redundant features. However, in practice, prime-implicant explanations - for which the identification problem is DP-complete - are slightly larger than majoritary reasons that can be generated using a simple linear-time greedy algorithm. They are also significantly larger than minimum-sized majoritary reasons which can be approached using an anytime Partial MaxSAT algorithm.

Downloads

Published

2022-06-28

How to Cite

Audemard, G., Bellart, S., Bounia, L., Koriche, F., Lagniez, J.-M., & Marquis, P. (2022). Trading Complexity for Sparsity in Random Forest Explanations. Proceedings of the AAAI Conference on Artificial Intelligence, 36(5), 5461-5469. https://doi.org/10.1609/aaai.v36i5.20484

Issue

Section

AAAI Technical Track on Knowledge Representation and Reasoning