How Reasoning Influences Intersectional Biases in Vision–Language Models (Student Abstract)

Authors

  • Adit Desai Jio Institute
  • Sudipta Roy Jio Institute
  • Mohna Chakraborty Jio Institute

DOI:

https://doi.org/10.1609/aaai.v40i48.42208

Abstract

Vision-Language Models (VLMs) are increasingly deployed across downstream tasks, yet their training data often encode social biases that surface in outputs. Unlike humans, who interpret images through contextual and social cues, VLMs process them through statistical associations, often leading to reasoning that diverges from human reasoning. By analyzing how a VLM reasons, we can understand how inherent biases are perpetuated and can adversely affect downstream performance. To examine this gap, we systematically analyze social biases in five open-source VLMs for an occupation prediction task, on the FairFace dataset. Across 32 occupations and three different prompting styles, we elicit both predictions and reasoning. Our findings show that the biased reasoning patterns systematically underlie intersectional disparities, highlighting the need to align VLM reasoning with human values before downstream deployment.

Downloads

Published

2026-03-14

How to Cite

Desai, A., Roy, S., & Chakraborty, M. (2026). How Reasoning Influences Intersectional Biases in Vision–Language Models (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41185–41187. https://doi.org/10.1609/aaai.v40i48.42208