LUCID: Exposing Algorithmic Bias through Inverse Design

Authors

  • Carmen Mazijn Vrije Universiteit Brussel
  • Carina Prunkl University of Oxford
  • Andres Algaba Vrije Universiteit Brussel
  • Jan Danckaert Vrije Universiteit Brussel
  • Vincent Ginis Vrije Universiteit Brussel Harvard University

DOI:

https://doi.org/10.1609/aaai.v37i12.26683

Keywords:

General

Abstract

AI systems can create, propagate, support, and automate bias in decision-making processes. To mitigate biased decisions, we both need to understand the origin of the bias and define what it means for an algorithm to make fair decisions. Most group fairness notions assess a model's equality of outcome by computing statistical metrics on the outputs. We argue that these output metrics encounter intrinsic obstacles and present a complementary approach that aligns with the increasing focus on equality of treatment. By Locating Unfairness through Canonical Inverse Design (LUCID), we generate a canonical set that shows the desired inputs for a model given a preferred output. The canonical set reveals the model's internal logic and exposes potential unethical biases by repeatedly interrogating the decision-making process. We evaluate LUCID on the UCI Adult and COMPAS data sets and find that some biases detected by a canonical set differ from those of output metrics. The results show that by shifting the focus towards equality of treatment and looking into the algorithm's internal workings, the canonical sets are a valuable addition to the toolbox of algorithmic fairness evaluation.

Downloads

Published

2023-06-26

How to Cite

Mazijn, C., Prunkl, C., Algaba, A., Danckaert, J., & Ginis, V. (2023). LUCID: Exposing Algorithmic Bias through Inverse Design. Proceedings of the AAAI Conference on Artificial Intelligence, 37(12), 14391-14399. https://doi.org/10.1609/aaai.v37i12.26683

Issue

Section

AAAI Special Track on AI for Social Impact