QueryProp: Object Query Propagation for High-Performance Video Object Detection

Authors

  • Fei He CRISE, Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Naiyu Gao CRISE, Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Jian Jia CRISE, Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Xin Zhao CRISE, Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Kaiqi Huang CRISE, Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences CAS Center for Excellence in Brain Science and Intelligence Technology

DOI:

https://doi.org/10.1609/aaai.v36i1.19965

Keywords:

Computer Vision (CV)

Abstract

Video object detection has been an important yet challenging topic in computer vision. Traditional methods mainly focus on designing the image-level or box-level feature propagation strategies to exploit temporal information. This paper argues that with a more effective and efficient feature propagation framework, video object detectors can gain improvement in terms of both accuracy and speed. For this purpose, this paper studies object-level feature propagation, and proposes an object query propagation (QueryProp) framework for high-performance video object detection. The proposed QueryProp contains two propagation strategies: 1) query propagation is performed from sparse key frames to dense non-key frames to reduce the redundant computation on non-key frames; 2) query propagation is performed from previous key frames to the current key frame to improve feature representation by temporal context modeling. To further facilitate query propagation, an adaptive propagation gate is designed to achieve flexible key frame selection. We conduct extensive experiments on the ImageNet VID dataset. QueryProp achieves comparable accuracy with state-of-the-art methods and strikes a decent accuracy/speed trade-off.

Downloads

Published

2022-06-28

How to Cite

He, F., Gao, N., Jia, J., Zhao, X., & Huang, K. (2022). QueryProp: Object Query Propagation for High-Performance Video Object Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 36(1), 834-842. https://doi.org/10.1609/aaai.v36i1.19965

Issue

Section

AAAI Technical Track on Computer Vision I