When Equal Isn’t Fair: Mitigating Over-Normalization in Large Language Models (Student Abstract)
DOI:
https://doi.org/10.1609/aaai.v40i48.42279Abstract
Bias in Large Language Models (LLMs) is increasingly addressed through fairness-oriented techniques. However, in some cases, these approaches may inadvertently remove genuine cultural differences between groups, leading to “over-normalization” or models losing important socio-cultural distinctions. In this work, we introduce OverNormEval, a benchmark designed to detect when an LLM exhibits such over-normalization. We further explore the use of Direct Preference Optimization (DPO) to mitigate over-normalization.Downloads
Published
2026-03-14
How to Cite
Satyadev, R., Ganesh Kumar, A., Anand, A., Shah, R. R., Wang, Z., & Prasad, M. (2026). When Equal Isn’t Fair: Mitigating Over-Normalization in Large Language Models (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41384–41386. https://doi.org/10.1609/aaai.v40i48.42279
Issue
Section
AAAI Student Abstract and Poster Program