Retinomorphic Object Detection in Asynchronous Visual Streams

Authors

  • Jianing Li Peking University
  • Xiao Wang Peng Cheng Laboratory
  • Lin Zhu Peking University
  • Jia Li Beihang University Peng Cheng Laboratory
  • Tiejun Huang Peking University
  • Yonghong Tian Peking University Peng Cheng Laboratory

DOI:

https://doi.org/10.1609/aaai.v36i2.20021

Keywords:

Computer Vision (CV)

Abstract

Due to high-speed motion blur and challenging illumination, conventional frame-based cameras have encountered an important challenge in object detection tasks. Neuromorphic cameras that output asynchronous visual streams instead of intensity frames, by taking the advantage of high temporal resolution and high dynamic range, have brought a new perspective to address the challenge. In this paper, we propose a novel problem setting, retinomorphic object detection, which is the first trial that integrates foveal-like and peripheral-like visual streams. Technically, we first build a large-scale multimodal neuromorphic object detection dataset (i.e., PKU-Vidar-DVS) over 215.5k spatio-temporal synchronized labels. Then, we design temporal aggregation representations to preserve the spatio-temporal information from asynchronous visual streams. Finally, we present a novel bio-inspired unifying framework to fuse two sensing modalities via a dynamic interaction mechanism. Our experimental evaluation shows that our approach has significant improvements over the state-of-the-art methods with the single-modality, especially in high-speed motion and low-light scenarios. We hope that our work will attract further research into this newly identified, yet crucial research direction. Our dataset can be available at https://www.pkuml.org/resources/pku-vidar-dvs.html.

Downloads

Published

2022-06-28

How to Cite

Li, J., Wang, X., Zhu, L., Li, J., Huang, T., & Tian, Y. (2022). Retinomorphic Object Detection in Asynchronous Visual Streams. Proceedings of the AAAI Conference on Artificial Intelligence, 36(2), 1332-1340. https://doi.org/10.1609/aaai.v36i2.20021

Issue

Section

AAAI Technical Track on Computer Vision II