Granger-Causal Attentive Mixtures of Experts: Learning Important Features with Neural Networks

Patrick Schwab; Djordje Miladinovic; Walter Karlen

doi:10.1609/aaai.v33i01.33014846

Granger-Causal Attentive Mixtures of Experts: Learning Important Features with Neural Networks

Authors

Patrick Schwab ETH Zurich
Djordje Miladinovic ETH Zurich
Walter Karlen ETH Zurich

DOI:

https://doi.org/10.1609/aaai.v33i01.33014846

Abstract

Knowledge of the importance of input features towards decisions made by machine-learning models is essential to increase our understanding of both the models and the underlying data. Here, we present a new approach to estimating feature importance with neural networks based on the idea of distributing the features of interest among experts in an attentive mixture of experts (AME). AMEs use attentive gating networks trained with a Granger-causal objective to learn to jointly produce accurate predictions as well as estimates of feature importance in a single model. Our experiments show (i) that the feature importance estimates provided by AMEs compare favourably to those provided by state-of-theart methods, (ii) that AMEs are significantly faster at estimating feature importance than existing methods, and (iii) that the associations discovered by AMEs are consistent with those reported by domain experts.

Downloads

Published

2019-07-17

How to Cite

Schwab, P., Miladinovic, D., & Karlen, W. (2019). Granger-Causal Attentive Mixtures of Experts: Learning Important Features with Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 4846–4853. https://doi.org/10.1609/aaai.v33i01.33014846

Download Citation

Issue

Vol. 33 No. 01: AAAI-19, IAAI-19, EAAI-20

Section

AAAI Technical Track: Machine Learning

Granger-Causal Attentive Mixtures of Experts: Learning Important Features with Neural Networks

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information