DMN4: Few-Shot Learning via Discriminative Mutual Nearest Neighbor Neural Network

Authors

  • Yang Liu State Key Lab of CAD&CG, College of Computer Science, Zhejiang University
  • Tu Zheng State Key Lab of CAD&CG, College of Computer Science, Zhejiang University FABU Inc., Hangzhou, China
  • Jie Song Zhejiang University
  • Deng Cai State Key Lab of CAD&CG, College of Computer Science, Zhejiang University FABU Inc., Hangzhou, China
  • Xiaofei He State Key Lab of CAD&CG, College of Computer Science, Zhejiang University FABU Inc., Hangzhou, China

DOI:

https://doi.org/10.1609/aaai.v36i2.20076

Keywords:

Computer Vision (CV)

Abstract

Few-shot learning (FSL) aims to classify images under low-data regimes, where the conventional pooled global feature is likely to lose useful local characteristics. Recent work has achieved promising performances by using deep descriptors. They generally take all deep descriptors from neural networks into consideration while ignoring that some of them are useless in classification due to their limited receptive field, e.g., task-irrelevant descriptors could be misleading and multiple aggregative descriptors from background clutter could even overwhelm the object's presence. In this paper, we argue that a Mutual Nearest Neighbor (MNN) relation should be established to explicitly select the query descriptors that are most relevant to each task and discard less relevant ones from aggregative clutters in FSL. Specifically, we propose Discriminative Mutual Nearest Neighbor Neural Network (DMN4) for FSL. Extensive experiments demonstrate that our method outperforms the existing state-of-the-arts on both fine-grained and generalized datasets.

Downloads

Published

2022-06-28

How to Cite

Liu, Y., Zheng, T., Song, J., Cai, D., & He, X. (2022). DMN4: Few-Shot Learning via Discriminative Mutual Nearest Neighbor Neural Network. Proceedings of the AAAI Conference on Artificial Intelligence, 36(2), 1828-1836. https://doi.org/10.1609/aaai.v36i2.20076

Issue

Section

AAAI Technical Track on Computer Vision II