Domain Decorrelation with Potential Energy Ranking

Authors

  • Sen Pei NLPR, Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Jiaxi Sun NLPR, Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Richard Yi Da Xu Hong Kong Baptist University
  • Shiming Xiang NLPR, Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Gaofeng Meng NLPR, Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences CAIR, HK Institute of Science and Innovation, Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v37i2.25294

Keywords:

CV: Representation Learning for Vision, ML: Classification and Regression, CV: Other Foundations of Computer Vision, ML: Multi-Class/Multi-Label Learning & Extreme Classification, ML: Clustering

Abstract

Machine learning systems, especially the methods based on deep learning, enjoy great success in modern computer vision tasks under ideal experimental settings. Generally, these classic deep learning methods are built on the i.i.d. assumption, supposing the training and test data are drawn from the same distribution independently and identically. However, the aforementioned i.i.d. assumption is, in general, unavailable in the real-world scenarios, and as a result, leads to sharp performance decay of deep learning algorithms. Behind this, domain shift is one of the primary factors to be blamed. In order to tackle this problem, we propose using Potential Energy Ranking (PoER) to decouple the object feature and the domain feature in given images, promoting the learning of label-discriminative representations while filtering out the irrelevant correlations between the objects and the background. PoER employs the ranking loss in shallow layers to make features with identical category and domain labels close to each other and vice versa. This makes the neural networks aware of both objects and background characteristics, which is vital for generating domain-invariant features. Subsequently, with the stacked convolutional blocks, PoER further uses the contrastive loss to make features within the same categories distribute densely no matter domains, filtering out the domain information progressively for feature alignment. PoER reports superior performance on domain generalization benchmarks, improving the average top-1 accuracy by at least 1.20% compared to the existing methods. Moreover, we use PoER in the ECCV 2022 NICO Challenge, achieving top place with only a vanilla ResNet-18 and winning the jury award. The code has been made publicly available at: https://github.com/ForeverPs/PoER.

Downloads

Published

2023-06-26

How to Cite

Pei, S., Sun, J., Xu, R. Y. D., Xiang, S., & Meng, G. (2023). Domain Decorrelation with Potential Energy Ranking. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 2020-2028. https://doi.org/10.1609/aaai.v37i2.25294

Issue

Section

AAAI Technical Track on Computer Vision II