FreqTS: Frequency-Aware Token Selection for Accelerating Diffusion Models

Authors

  • Xinye Yang Newcastle University
  • Yuxin Yang University of Hong Kong
  • Haoran Pang National University of Singapore
  • Aaron Xuxiang Tian Independent researcher
  • Luking Li Independent researcher

DOI:

https://doi.org/10.1609/aaai.v39i9.33008

Abstract

In this paper, we propose FreqTS, a novel Frequency-Aware Token Selection approach for accelerating diffusion models without requiring retraining. Diffusion models have gained significant attention in the field of image synthesis due to their impressive generative capabilities. However, these models often suffer from high computational costs, primarily due to the sequential denoising process and large model size. Additionally, diffusion models tend to prioritize low-frequency features, leading to sub-optimal quantitative results. To address these challenges, FreqTS introduces an amplitude-based sorting method that separates Token features in the frequency domain of diffusion models into high-frequency and low-frequency subsets. It then utilizes fast Token Selection to reduce the presence of low-frequency features, effectively reducing the computational overhead. Moreover, FreqTS incorporates a Bayesian hyper-parameter search to dynamically assign different selection strategies for various denoising processes. Extensive experiments conducted on Stable Diffusion series models, PixArt-Alpha, LCM, and other models demonstrate that FreqTS achieves a minimum acceleration of 2.3× without the need for retraining. Furthermore, FreqTS showcases its versatility by being applicable to different sampling techniques and compatible with other dimension-specific acceleration algorithms.

Downloads

Published

2025-04-11

How to Cite

Yang, X., Yang, Y., Pang, H., Tian, A. X., & Li, L. (2025). FreqTS: Frequency-Aware Token Selection for Accelerating Diffusion Models. Proceedings of the AAAI Conference on Artificial Intelligence, 39(9), 9310-9317. https://doi.org/10.1609/aaai.v39i9.33008

Issue

Section

AAAI Technical Track on Computer Vision VIII