Revisiting the Importance of Amplifying Bias for Debiasing

Authors

  • Jungsoo Lee Graduate School of Artificial Intelligence, KAIST Kakao Enterprise
  • Jeonghoon Park Korea Advanced Institute of Science and Technology Kakao Enterprise
  • Daeyoung Kim Korea Advanced Institute of Science and Technology
  • Juyoung Lee Kakao Enterprise
  • Edward Choi KAIST
  • Jaegul Choo Korea Advanced Institute of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v37i12.26748

Keywords:

General

Abstract

In image classification, debiasing aims to train a classifier to be less susceptible to dataset bias, the strong correlation between peripheral attributes of data samples and a target class. For example, even if the frog class in the dataset mainly consists of frog images with a swamp background (i.e., bias aligned samples), a debiased classifier should be able to correctly classify a frog at a beach (i.e., bias conflicting samples). Recent debiasing approaches commonly use two components for debiasing, a biased model fB and a debiased model fD. fB is trained to focus on bias aligned samples (i.e., overfitted to the bias) while fD is mainly trained with bias conflicting samples by concentrating on samples which fB fails to learn, leading fD to be less susceptible to the dataset bias. While the state of the art debiasing techniques have aimed to better train fD, we focus on training fB, an overlooked component until now. Our empirical analysis reveals that removing the bias conflicting samples from the training set for fB is important for improving the debiasing performance of fD. This is due to the fact that the bias conflicting samples work as noisy samples for amplifying the bias for fB since those samples do not include the bias attribute. To this end, we propose a simple yet effective data sample selection method which removes the bias conflicting samples to construct a bias amplified dataset for training fB. Our data sample selection method can be directly applied to existing reweighting based debiasing approaches, obtaining consistent performance boost and achieving the state of the art performance on both synthetic and real-world datasets.

Downloads

Published

2023-06-26

How to Cite

Lee, J., Park, J., Kim, D., Lee, J., Choi, E., & Choo, J. (2023). Revisiting the Importance of Amplifying Bias for Debiasing. Proceedings of the AAAI Conference on Artificial Intelligence, 37(12), 14974-14981. https://doi.org/10.1609/aaai.v37i12.26748

Issue

Section

AAAI Special Track on Safe and Robust AI