The Silent Amplifier: In-Context Examples Fuel Bias in Large Language Models
DOI:
https://doi.org/10.1609/aaai.v40i42.40856Abstract
In-context learning (ICL) has proven to be adept at adapting large language models (LLMs) to downstream tasks without parameter updates, based on a few demonstration examples. Prior work has found that the ICL performance is susceptible to the selection of examples in prompt and made efforts to stabilize it. However, existing example selection studies ignore the ethical risks behind the examples selected, such as gender and race bias. In this work, we conduct extensive experiments and discover that (1) example selection with high accuracy does not mean low bias; (2) example selection for ICL may amplify the biases of LLMs; (3) example selection contributes to spurious correlations of LLMs. Based on the above observations, we propose the Remind with Bias-aware Embedding (ReBE), which removes the spurious correlations through contrastive learning and obtains bias-aware embedding for LLMs based on prompt tuning. Finally, we demonstrate that ReBE effectively mitigates biases of LLMs without significantly compromising accuracy and is highly compatible with existing example selection methods.Downloads
Published
2026-03-14
How to Cite
Guo, X., Gao, J., Zhou, J., Zhang, J., Liu, Q., Wu, H., … Wei, X. (2026). The Silent Amplifier: In-Context Examples Fuel Bias in Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(42), 35464–35472. https://doi.org/10.1609/aaai.v40i42.40856
Issue
Section
AAAI Technical Track on Philosophy and Ethics of AI