Interpretable and Differentially Private Predictions

Frederik Harder; Matthias Bauer; Mijung Park

doi:10.1609/aaai.v34i04.5827

Authors

Frederik Harder University of Tuebingen
Matthias Bauer Max Planck Institute for Intelligent Systems
Mijung Park University of Tuebingen

DOI:

https://doi.org/10.1609/aaai.v34i04.5827

Abstract

Interpretable predictions, which clarify why a machine learning model makes a particular decision, can compromise privacy by revealing the characteristics of individual data points. This raises the central question addressed in this paper: Can models be interpretable without compromising privacy? For complex “big” data fit by correspondingly rich models, balancing privacy and explainability is particularly challenging, such that this question has remained largely unexplored. In this paper, we propose a family of simple models with the aim of approximating complex models using several locally linear maps per class to provide high classification accuracy, as well as differentially private explanations on the classification. We illustrate the usefulness of our approach on several image benchmark datasets as well as a medical dataset.

Interpretable and Differentially Private Predictions

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription