REGLO: Provable Neural Network Repair for Global Robustness Properties

Feisi Fu; Zhilu Wang; Weichao Zhou; Yixuan Wang; Jiameng Fan; Chao Huang; Qi Zhu; Xin Chen; Wenchao Li

doi:10.1609/aaai.v38i11.29094

Authors

Feisi Fu Boston University
Zhilu Wang Northwestern University
Weichao Zhou Boston University
Yixuan Wang Northwestern University
Jiameng Fan Boston University
Chao Huang Univeristy of Liverpool
Qi Zhu Northwestern University
Xin Chen University of New Mexico
Wenchao Li Boston University

DOI:

https://doi.org/10.1609/aaai.v38i11.29094

Keywords:

ML: Ethics, Bias, and Fairness, ML: Adversarial Learning & Robustness, ML: Privacy

Abstract

We present REGLO, a novel methodology for repairing pretrained neural networks to satisfy global robustness and individual fairness properties. A neural network is said to be globally robust with respect to a given input region if and only if all the input points in the region are locally robust. This notion of global robustness also captures the notion of individual fairness as a special case. We prove that any counterexample to a global robustness property must exhibit a corresponding large gradient. For ReLU networks, this result allows us to efficiently identify the linear regions that violate a given global robustness property. By formulating and solving a suitable robust convex optimization problem, REGLO then computes a minimal weight change that will provably repair these violating linear regions.

REGLO: Provable Neural Network Repair for Global Robustness Properties

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription