The Silent Amplifier: In-Context Examples Fuel Bias in Large Language Models

Xinwei Guo; Jiashi Gao; Junlei Zhou; Jiaxin Zhang; Quanying Liu; Haiyan Wu; Xin Yao; Xuetao Wei

doi:10.1609/aaai.v40i42.40856

Authors

Xinwei Guo Southern University of Science and Technology
Jiashi Gao Southern University of Science and Technology
Junlei Zhou Southern University of Science and Technology
Jiaxin Zhang Southern University of Science and Technology
Quanying Liu Southern University of Science and Technology
Haiyan Wu University of Macau
Xin Yao Lingnan University
Xuetao Wei Southern University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v40i42.40856

Abstract

In-context learning (ICL) has proven to be adept at adapting large language models (LLMs) to downstream tasks without parameter updates, based on a few demonstration examples. Prior work has found that the ICL performance is susceptible to the selection of examples in prompt and made efforts to stabilize it. However, existing example selection studies ignore the ethical risks behind the examples selected, such as gender and race bias. In this work, we conduct extensive experiments and discover that (1) example selection with high accuracy does not mean low bias; (2) example selection for ICL may amplify the biases of LLMs; (3) example selection contributes to spurious correlations of LLMs. Based on the above observations, we propose the Remind with Bias-aware Embedding (ReBE), which removes the spurious correlations through contrastive learning and obtains bias-aware embedding for LLMs based on prompt tuning. Finally, we demonstrate that ReBE effectively mitigates biases of LLMs without significantly compromising accuracy and is highly compatible with existing example selection methods.

The Silent Amplifier: In-Context Examples Fuel Bias in Large Language Models

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information