Training-Free ANN-to-SNN Conversion for High-Performance Spiking Transformers

Jingya Wang; Xin Deng; Wenjie Wei; Dehao Zhang; Shuai Wang; Qian Sun; Jieyuan Zhang; Hanwen Liu; Ning Xie; Malu Zhang

doi:10.1609/aaai.v40i3.37195

Authors

Jingya Wang University of Electronic Science and Technology of China
Xin Deng University of Electronic Science and Technology of China
Wenjie Wei University of Electronic Science and Technology of China
Dehao Zhang University of Electronic Science and Technology of China
Shuai Wang University of Electronic Science and Technology of China
Qian Sun University of Electronic Science and Technology of China
Jieyuan Zhang University of Electronic Science and Technology of China
Hanwen Liu University of Electronic Science and Technology of China
Ning Xie University of Electronic Science and Technology of China
Malu Zhang University of Electronic Science and Technology of China Shenzhen Loop Area Institute

DOI:

https://doi.org/10.1609/aaai.v40i3.37195

Abstract

Leveraging the event-driven paradigm, Spiking Neural Networks (SNNs) offer a promising approach for constructing energy-efficient Transformer architectures. Compared to directly trained Spiking Transformers, ANN-to-SNN conversion methods bypass the high training costs. However, existing methods still suffer from notable limitations, failing to effectively handle nonlinear operations in Transformer architectures and requiring additional fine-tuning processes for pre-trained ANNs. To address these issues, we propose a high-performance and training-free ANN-to-SNN conversion framework tailored for Transformer architectures. Specifically, we introduce a Multi-basis Exponential Decay (MBE) neuron, which employs an exponential decay strategy and multi-basis encoding method to efficiently approximate various nonlinear operations. It removes the requirement for weight modifications in pre-trained ANNs. Extensive experiments across diverse tasks (CV, NLU, NLG) and mainstream Transformer architectures (ViT, RoBERTa, GPT-2) demonstrate that our method achieves near-lossless conversion accuracy with significantly lower latency. This provides a promising pathway for the efficient and scalable deployment of Spiking Transformers in real-world applications.

Training-Free ANN-to-SNN Conversion for High-Performance Spiking Transformers

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information