Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing

Authors

  • Zhili LIU Huawei Noah's Ark Lab HKUST
  • Jianhua Han Huawei Noah's Ark Lab
  • Lanqing Hong Huawei Noah's Ark Lab
  • Hang Xu Huawei Noah's Ark Lab
  • Kai Chen HKUST
  • Chunjing Xu Huawei Noah's Ark Lab
  • Zhenguo Li Huawei Noah's Ark Lab

DOI:

https://doi.org/10.1609/aaai.v36i2.20079

Keywords:

Computer Vision (CV)

Abstract

Self-supervised learning (SSL), especially contrastive methods, has raised attraction recently as it learns effective transferable representations without semantic annotations. A common practice for self-supervised pre-training is to use as much data as possible. For a specific downstream task, however, involving irrelevant data in pre-training may degenerate the downstream performance, observed from our extensive experiments. On the other hand, for existing SSL methods, it is burdensome and infeasible to use different downstream-task-customized datasets in pre-training for different tasks. To address this issue, we propose a novel SSL paradigm called Scalable Dynamic Routing (SDR), which can be trained once and deployed efficiently to different downstream tasks with task-customized pre-trained models. Specifically, we construct the SDRnet with various sub-nets and train each sub-net with only one subset of the data by data-aware progressive training. When a downstream task arrives, we route among all the pre-trained sub-nets to get the best along with its corresponding weights. Experiment results show that our SDR can train 256 sub-nets on ImageNet simultaneously, which provides better transfer performance than a unified model trained on the full ImageNet, achieving state-of-the-art (SOTA) averaged accuracy over 11 downstream classification tasks and AP on PASCAL VOC detection task.

Downloads

Published

2022-06-28

How to Cite

LIU, Z., Han, J., Hong, L., Xu, H., Chen, K., Xu, C., & Li, Z. (2022). Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing. Proceedings of the AAAI Conference on Artificial Intelligence, 36(2), 1854-1862. https://doi.org/10.1609/aaai.v36i2.20079

Issue

Section

AAAI Technical Track on Computer Vision II