Learning Temporal-Ordered Representation for Spike Streams Based on Discrete Wavelet Transforms

Authors

  • Jiyuan Zhang Peking University
  • Shanshan Jia Peking University
  • Zhaofei Yu Peking University
  • Tiejun Huang Peking University

DOI:

https://doi.org/10.1609/aaai.v37i1.25085

Keywords:

CMS: Brain Modeling, CV: Computational Photography, Image & Video Synthesis

Abstract

Spike camera, a new type of neuromorphic visual sensor that imitates the sampling mechanism of the primate fovea, can capture photons and output 40000 Hz binary spike streams. Benefiting from the asynchronous sampling mechanism, the spike camera can record fast-moving objects and clear images can be recovered from the spike stream at any specified timestamps without motion blurring. Despite these, due to the dense time sequence information of the discrete spike stream, it is not easy to directly apply the existing algorithms of traditional cameras to the spike camera. Therefore, it is necessary and interesting to explore a universally effective representation of dense spike streams to better fit various network architectures. In this paper, we propose to mine temporal-robust features of spikes in time-frequency space with wavelet transforms. We present a novel Wavelet-Guided Spike Enhancing (WGSE) paradigm consisting of three consecutive steps: multi-level wavelet transform, CNN-based learnable module, and inverse wavelet transform. With the assistance of WGSE, the new streaming representation of spikes can be learned. We demonstrate the effectiveness of WGSE on two downstream tasks, achieving state-of-the-art performance on the image reconstruction task and getting considerable performance on semantic segmentation. Furthermore, We build a new spike-based synthesized dataset for semantic segmentation. Code and Datasets are available at https://github.com/Leozhangjiyuan/WGSE-SpikeCamera.

Downloads

Published

2023-06-26

How to Cite

Zhang, J., Jia, S., Yu, Z., & Huang, T. (2023). Learning Temporal-Ordered Representation for Spike Streams Based on Discrete Wavelet Transforms. Proceedings of the AAAI Conference on Artificial Intelligence, 37(1), 137-147. https://doi.org/10.1609/aaai.v37i1.25085

Issue

Section

AAAI Technical Track on Cognitive Modeling & Cognitive Systems