Incremental-DETR: Incremental Few-Shot Object Detection via Self-Supervised Learning

Authors

  • Na Dong National University of Singapore Harbin Institute of Technology
  • Yongqiang Zhang Harbin institute of Technology
  • Mingli Ding Harbin institute of Technology
  • Gim Hee Lee National University of Singapore

DOI:

https://doi.org/10.1609/aaai.v37i1.25129

Keywords:

CV: Object Detection & Categorization, CV: Learning & Optimization for CV, CV: Representation Learning for Vision

Abstract

Incremental few-shot object detection aims at detecting novel classes without forgetting knowledge of the base classes with only a few labeled training data from the novel classes. Most related prior works are on incremental object detection that rely on the availability of abundant training samples per novel class that substantially limits the scalability to real-world setting where novel data can be scarce. In this paper, we propose the Incremental-DETR that does incremental few-shot object detection via fine-tuning and self-supervised learning on the DETR object detector. To alleviate severe over-fitting with few novel class data, we first fine-tune the class-specific components of DETR with self-supervision from additional object proposals generated using Selective Search as pseudo labels. We further introduce an incremental few-shot fine-tuning strategy with knowledge distillation on the class-specific components of DETR to encourage the network in detecting novel classes without forgetting the base classes. Extensive experiments conducted on standard incremental object detection and incremental few-shot object detection settings show that our approach significantly outperforms state-of-the-art methods by a large margin. Our source code is available at https://github.com/dongnana777/Incremental-DETR.

Downloads

Published

2023-06-26

How to Cite

Dong, N., Zhang, Y., Ding, M., & Lee, G. H. (2023). Incremental-DETR: Incremental Few-Shot Object Detection via Self-Supervised Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 37(1), 543-551. https://doi.org/10.1609/aaai.v37i1.25129

Issue

Section

AAAI Technical Track on Computer Vision I