Enhancing Contrastive Learning Inspired by the Philosophy of “The Blind Men and the Elephant”

Authors

  • Yudong Zhang Tsinghua University Tencent
  • Ruobing Xie Tencent
  • Jiansheng Chen University of Science and Technology Beijing
  • Xingwu Sun Tencent University of Macau
  • Zhanhui Kang Tencent
  • Yu Wang Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v39i21.34425

Abstract

Contrastive learning is a prevalent technique in self-supervised vision representation learning, typically generating positive pairs by applying two data augmentations to the same image. Designing effective data augmentation strategies is crucial for the success of contrastive learning. Inspired by the story of the blind men and the elephant, we introduce JointCrop and JointBlur. These methods generate more challenging positive pairs by leveraging the joint distribution of the two augmentation parameters, thereby enabling contrastive learning to acquire more effective feature representations. To the best of our knowledge, this is the first effort to explicitly incorporate the joint distribution of two data augmentation parameters into contrastive learning. As a plug-and-play framework without additional computational overhead, JointCrop and JointBlur enhance the performance of SimCLR, BYOL, MoCo v1, MoCo v2, MoCo v3, SimSiam, and Dino baselines with notable improvements.

Downloads

Published

2025-04-11

How to Cite

Zhang, Y., Xie, R., Chen, J., Sun, X., Kang, Z., & Wang, Y. (2025). Enhancing Contrastive Learning Inspired by the Philosophy of “The Blind Men and the Elephant”. Proceedings of the AAAI Conference on Artificial Intelligence, 39(21), 22659–22667. https://doi.org/10.1609/aaai.v39i21.34425

Issue

Section

AAAI Technical Track on Machine Learning VII