Bi-directional Feature Reconstruction Network for Fine-Grained Few-Shot Image Classification

Authors

  • Jijie Wu Lanzhou University of Technology
  • Dongliang Chang Beijing University of Posts and Telecommunications
  • Aneeshan Sain University of Surrey
  • Xiaoxu Li Lanzhou University of Technology
  • Zhanyu Ma Beijing University of Posts and Telecommunications
  • Jie Cao Lanzhou University of Technology
  • Jun Guo Beijing University of Posts and Telecommunications
  • Yi-Zhe Song University of Surrey

DOI:

https://doi.org/10.1609/aaai.v37i3.25383

Keywords:

CV: Object Detection & Categorization, CV: Learning & Optimization for CV, CV: Other Foundations of Computer Vision, CV: Representation Learning for Vision, ML: Classification and Regression, ML: Deep Learning Theory, ML: Deep Neural Network Algorithms, ML: Learning Theory, ML: Meta Learning, ML: Optimization, ML: Representation Learning

Abstract

The main challenge for fine-grained few-shot image classification is to learn feature representations with higher inter-class and lower intra-class variations, with a mere few labelled samples. Conventional few-shot learning methods however cannot be naively adopted for this fine-grained setting -- a quick pilot study reveals that they in fact push for the opposite (i.e., lower inter-class variations and higher intra-class variations). To alleviate this problem, prior works predominately use a support set to reconstruct the query image and then utilize metric learning to determine its category. Upon careful inspection, we further reveal that such unidirectional reconstruction methods only help to increase inter-class variations and are not effective in tackling intra-class variations. In this paper, we for the first time introduce a bi-reconstruction mechanism that can simultaneously accommodate for inter-class and intra-class variations. In addition to using the support set to reconstruct the query set for increasing inter-class variations, we further use the query set to reconstruct the support set for reducing intra-class variations. This design effectively helps the model to explore more subtle and discriminative features which is key for the fine-grained problem in hand. Furthermore, we also construct a self-reconstruction module to work alongside the bi-directional module to make the features even more discriminative. Experimental results on three widely used fine-grained image classification datasets consistently show considerable improvements compared with other methods. Codes are available at: https://github.com/PRIS-CV/Bi-FRN.

Downloads

Published

2023-06-26

How to Cite

Wu, J., Chang, D., Sain, A., Li, X., Ma, Z., Cao, J., Guo, J., & Song, Y.-Z. (2023). Bi-directional Feature Reconstruction Network for Fine-Grained Few-Shot Image Classification. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 2821-2829. https://doi.org/10.1609/aaai.v37i3.25383

Issue

Section

AAAI Technical Track on Computer Vision III