Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests

Authors

  • Kristin Blesch Leibniz Institute for Prevention Research and Epidemiology – BIPS Faculty of Mathematics and Computer Science, University of Bremen
  • Niklas Koenen Leibniz Institute for Prevention Research and Epidemiology – BIPS Faculty of Mathematics and Computer Science, University of Bremen
  • Jan Kapar Leibniz Institute for Prevention Research and Epidemiology – BIPS Faculty of Mathematics and Computer Science, University of Bremen
  • Pegah Golchian Leibniz Institute for Prevention Research and Epidemiology – BIPS Faculty of Mathematics and Computer Science, University of Bremen
  • Lukas Burk Leibniz Institute for Prevention Research and Epidemiology – BIPS Faculty of Mathematics and Computer Science, University of Bremen
  • Markus Loecher Department of Business and Economics, Berlin School of Economics and Law
  • Marvin N. Wright Leibniz Institute for Prevention Research and Epidemiology – BIPS Faculty of Mathematics and Computer Science, University of Bremen Department of Public Health, University of Copenhagen

DOI:

https://doi.org/10.1609/aaai.v39i15.33712

Abstract

This paper proposes a method for measuring conditional feature importance via generative modeling. In explainable artificial intelligence (XAI), conditional feature importance assesses the impact of a feature on a prediction model's performance given the information of other features. Model-agnostic post hoc methods to do so typically evaluate changes in the predictive performance under on-manifold feature value manipulations. Such procedures require creating feature values that respect conditional feature distributions, which can be challenging in practice. Recent advancements in generative modeling can facilitate this. For tabular data, which may consist of both categorical and continuous features, the adversarial random forest (ARF) stands out as a generative model that can generate on-manifold data points without requiring intensive tuning efforts or computational resources, making it a promising candidate model for subroutines in XAI methods. This paper proposes cARFi (conditional ARF feature importance), a method for measuring conditional feature importance through feature values sampled from ARF-estimated conditional distributions. cARFi requires only little tuning to yield robust importance scores that can flexibly adapt for conditional or marginal notions of feature importance, including straightforward extensions to condition on feature subsets and allows for inferring the significance of feature importances through statistical tests.

Published

2025-04-11

How to Cite

Blesch, K., Koenen, N., Kapar, J., Golchian, P., Burk, L., Loecher, M., & Wright, M. N. (2025). Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests. Proceedings of the AAAI Conference on Artificial Intelligence, 39(15), 15596–15604. https://doi.org/10.1609/aaai.v39i15.33712

Issue

Section

AAAI Technical Track on Machine Learning I