Optical Flow for Spike Camera with Hierarchical Spatial-Temporal Spike Fusion

Authors

  • Rui Zhao Peking University
  • Ruiqin Xiong Peking University
  • Jian Zhang Peking University
  • Xinfeng Zhang University of Chinese Academy of Sciences
  • Zhaofei Yu Peking University
  • Tiejun Huang Peking University

DOI:

https://doi.org/10.1609/aaai.v38i7.28581

Keywords:

CV: Computational Photography, Image & Video Synthesis, CV: Low Level & Physics-based Vision, CV: Motion & Tracking

Abstract

As an emerging neuromorphic camera with an asynchronous working mechanism, spike camera shows good potential for high-speed vision tasks. Each pixel in spike camera accumulates photons persistently and fires a spike whenever the accumulation exceeds a threshold. Such high-frequency fine-granularity photon recording facilitates the analysis and recovery of dynamic scenes with high-speed motion. This paper considers the optical flow estimation problem for spike cameras. Due to the Poisson nature of incoming photons, the occurrence of spikes is random and fluctuating, making conventional image matching inefficient. We propose a Hierarchical Spatial-Temporal (HiST) fusion module for spike representation to pursue reliable feature matching and develop a robust optical flow network, dubbed as HiST-SFlow. The HiST extracts features at multiple moments and hierarchically fuses the spatial-temporal information. We also propose an intra-moment filtering module to further extract the feature and suppress the influence of randomness in spikes. A scene loss is proposed to ensure that this hierarchical representation recovers the essential visual information in the scene. Experimental results demonstrate that the proposed method achieves state-of-the-art performance compared with the existing methods. The source codes are available at https://github.com/ruizhao26/HiST-SFlow.

Published

2024-03-24

How to Cite

Zhao, R., Xiong, R., Zhang, J., Zhang, X., Yu, Z., & Huang, T. (2024). Optical Flow for Spike Camera with Hierarchical Spatial-Temporal Spike Fusion. Proceedings of the AAAI Conference on Artificial Intelligence, 38(7), 7496-7504. https://doi.org/10.1609/aaai.v38i7.28581

Issue

Section

AAAI Technical Track on Computer Vision VI