SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection

Authors

  • Jiahao Wang School of Computer Science and Technology, MOEKLINNS Laboratory, Xi'an Jiaotong University
  • Caixia Yan School of Computer Science and Technology, MOEKLINNS Laboratory, Xi'an Jiaotong University
  • Weizhan Zhang School of Computer Science and Technology, MOEKLINNS Laboratory, Xi'an Jiaotong University
  • Huan Liu School of Computer Science and Technology, MOEKLINNS Laboratory, Xi'an Jiaotong University
  • Hao Sun China Telecom Artificial Intelligence Technology Co.Ltd
  • Qinghua Zheng School of Computer Science and Technology, MOEKLINNS Laboratory, Xi'an Jiaotong University

DOI:

https://doi.org/10.1609/aaai.v38i6.28353

Keywords:

CV: Object Detection & Categorization

Abstract

Zero-shot object detection (ZSD) aims to localize and classify unseen objects without access to their training annotations. As a prevailing solution to ZSD, generation-based methods synthesize unseen visual features by taking seen features as reference and class semantic embeddings as guideline. Although previous works continuously improve the synthesis quality, they fail to consider the scale-varying nature of unseen objects. The generation process is preformed over a single scale of object features and thus lacks scale-diversity among synthesized features. In this paper, we reveal the scale-varying challenge in ZSD and propose a Scale-Aware Unseen Imagineer (SAUI) to lead the way of a novel scale-aware ZSD paradigm. To obtain multi-scale features of seen-class objects, we design a specialized coarse-to-fine extractor to capture features through multiple scale-views. To generate unseen features scale by scale, we innovate a Series-GAN synthesizer along with three scale-aware contrastive components to imagine separable, diverse and robust scale-wise unseen features. Extensive experiments on PASCAL VOC, COCO and DIOR datasets demonstrate SAUI's better performance in different scenarios, especially for scale-varying and small objects. Notably, SAUI achieves the new state-of-the art performance on COCO and DIOR.

Downloads

Published

2024-03-24

How to Cite

Wang, J., Yan, C., Zhang, W., Liu, H., Sun, H., & Zheng, Q. (2024). SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5445-5453. https://doi.org/10.1609/aaai.v38i6.28353

Issue

Section

AAAI Technical Track on Computer Vision V