Rethinking Bias in Generative Data Augmentation for Medical AI: A Frequency Recalibration Method

Authors

  • Chi Liu City University of Macau
  • Jincheng Liu City University of Macau
  • Congcong Zhu City University of Macau
  • Minghao Wang City University of Macau
  • Sheng Shen Torrens University Australia
  • Jia Gu City University of Macau
  • Tianqing Zhu City University of Macau
  • Wanlei Zhou City University of Macau

DOI:

https://doi.org/10.1609/aaai.v40i9.37644

Abstract

Developing Medical AI relies on large datasets and easily suffers from data scarcity. Generative data augmentation (GDA) using AI generative models offers a solution to synthesize realistic medical images. However, the bias in GDA is often underestimated in medical domains, with concerns about the risk of introducing detrimental features generated by AI and harming downstream tasks. This paper identifies the frequency misalignment between real and synthesized images as one of the key factors underlying unreliable GDA and proposes the Frequency Recalibration (FreRec) method to reduce the frequency distributional discrepancy and thus improve GDA. FreRec involves (1) Statistical High-frequency Replacement (SHR) to roughly align high-frequency components and (2) Reconstructive High-frequency Mapping (RHM) to enhance image quality and reconstruct high-frequency details. Extensive experiments were conducted in various medical datasets, including brain MRIs, chest X-rays, and fundus images. The results show that FreRec significantly improves downstream medical image classification performance compared to uncalibrated AI-synthesized samples. FreRec is a standalone post-processing step that is compatible with any generative model and can integrate seamlessly with common medical GDA pipelines.

Downloads

Published

2026-03-14

How to Cite

Liu, C., Liu, J., Zhu, C., Wang, M., Shen, S., Gu, J., … Zhou, W. (2026). Rethinking Bias in Generative Data Augmentation for Medical AI: A Frequency Recalibration Method. Proceedings of the AAAI Conference on Artificial Intelligence, 40(9), 7087–7095. https://doi.org/10.1609/aaai.v40i9.37644

Issue

Section

AAAI Technical Track on Computer Vision VI