Boosting Contrastive Learning with Relation Knowledge Distillation

Authors

  • Kai Zheng Megvii Technology
  • Yuanjiang Wang Megvii Technology
  • Ye Yuan Megvii Technology

DOI:

https://doi.org/10.1609/aaai.v36i3.20262

Keywords:

Computer Vision (CV), Machine Learning (ML)

Abstract

While self-supervised representation learning (SSL) has proved to be effective in the large model, there is still a huge gap between the SSL and supervised method in the lightweight model when following the same solution. We delve into this problem and find that the lightweight model is prone to collapse in semantic space when simply performing instance-wise contrast. To address this issue, we propose a relation-wise contrastive paradigm with Relation Knowledge Distillation (ReKD). We introduce a heterogeneous teacher to explicitly mine the semantic information and transferring a novel relation knowledge to the student (lightweight model). The theoretical analysis supports our main concern about instance-wise contrast and verify the effectiveness of our relation-wise contrastive learning. Extensive experimental results also demonstrate that our method achieves significant improvements on multiple lightweight models. Particularly, the linear evaluation on AlexNet obviously improves the current state-of-art from 44.7% to 50.1% , which is the first work to get close to the supervised (50.5%). Code will be made available.

Downloads

Published

2022-06-28

How to Cite

Zheng, K., Wang, Y., & Yuan, Y. (2022). Boosting Contrastive Learning with Relation Knowledge Distillation. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 3508-3516. https://doi.org/10.1609/aaai.v36i3.20262

Issue

Section

AAAI Technical Track on Computer Vision III