FreqTS: Frequency-Aware Token Selection for Accelerating Diffusion Models

Xinye Yang; Yuxin Yang; Haoran Pang; Aaron Xuxiang Tian; Luking Li

doi:10.1609/aaai.v39i9.33008

Authors

Xinye Yang Newcastle University
Yuxin Yang University of Hong Kong
Haoran Pang National University of Singapore
Aaron Xuxiang Tian Independent researcher
Luking Li Independent researcher

DOI:

https://doi.org/10.1609/aaai.v39i9.33008

Abstract

In this paper, we propose FreqTS, a novel Frequency-Aware Token Selection approach for accelerating diffusion models without requiring retraining. Diffusion models have gained significant attention in the field of image synthesis due to their impressive generative capabilities. However, these models often suffer from high computational costs, primarily due to the sequential denoising process and large model size. Additionally, diffusion models tend to prioritize low-frequency features, leading to sub-optimal quantitative results. To address these challenges, FreqTS introduces an amplitude-based sorting method that separates Token features in the frequency domain of diffusion models into high-frequency and low-frequency subsets. It then utilizes fast Token Selection to reduce the presence of low-frequency features, effectively reducing the computational overhead. Moreover, FreqTS incorporates a Bayesian hyper-parameter search to dynamically assign different selection strategies for various denoising processes. Extensive experiments conducted on Stable Diffusion series models, PixArt-Alpha, LCM, and other models demonstrate that FreqTS achieves a minimum acceleration of 2.3× without the need for retraining. Furthermore, FreqTS showcases its versatility by being applicable to different sampling techniques and compatible with other dimension-specific acceleration algorithms.

FreqTS: Frequency-Aware Token Selection for Accelerating Diffusion Models

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information