Trading Complexity for Sparsity in Random Forest Explanations

Gilles Audemard; Steve Bellart; Louènas Bounia; Frédéric Koriche; Jean-Marie Lagniez; Pierre Marquis

doi:10.1609/aaai.v36i5.20484

Authors

Gilles Audemard CRIL-CNRS, U. Artois
Steve Bellart CRIL-CNRS, U. Artois
Louènas Bounia CRIL-CNRS, U. Artois
Frédéric Koriche CRIL-CNRS, U. Artois
Jean-Marie Lagniez CRIL-CNRS, U. Artois
Pierre Marquis CRIL-CNRS, U. Artois, IUF

DOI:

https://doi.org/10.1609/aaai.v36i5.20484

Keywords:

Knowledge Representation And Reasoning (KRR)

Abstract

Random forests have long been considered as powerful model ensembles in machine learning. By training multiple decision trees, whose diversity is fostered through data and feature subsampling, the resulting random forest can lead to more stable and reliable predictions than a single decision tree. This however comes at the cost of decreased interpretability: while decision trees are often easily interpretable, the predictions made by random forests are much more difficult to understand, as they involve a majority vote over multiple decision trees. In this paper, we examine different types of reasons that explain "why" an input instance is classified as positive or negative by a Boolean random forest. Notably, as an alternative to prime-implicant explanations taking the form of subset-minimal implicants of the random forest, we introduce majoritary reasons which are subset-minimal implicants of a strict majority of decision trees. For these abductive explanations, the tractability of the generation problem (finding one reason) and the optimization problem (finding one minimum-sized reason) are investigated. Unlike prime-implicant explanations, majoritary reasons may contain redundant features. However, in practice, prime-implicant explanations - for which the identification problem is DP-complete - are slightly larger than majoritary reasons that can be generated using a simple linear-time greedy algorithm. They are also significantly larger than minimum-sized majoritary reasons which can be approached using an anytime Partial MaxSAT algorithm.

Trading Complexity for Sparsity in Random Forest Explanations

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription