Accelerating Controllable Generation via Hybrid-grained Cache

Authors

  • Lin Liu University of Science and Technology of China
  • Huixia Ben Anhui University of Science and Technology
  • Shuo Wang University of Science and Technology of China
  • Jinda Lu University of Science and Technology of China
  • Junxiang Qiu University of Science and Technology of China
  • Shengeng Tang Hefei University of Technology
  • Yanbin Hao Hefei University of Technology

DOI:

https://doi.org/10.1609/aaai.v40i9.37658

Abstract

Controllable generative models have been widely used to improve the realism of synthetic visual content. However, such models must handle control conditions and content generation computational requirements, resulting in generally low generation efficiency. To address this issue, we propose a Hybrid-Grained Cache (HGC) approach that reduces computational overhead by adopting cache strategies with different granularities at different computational stages. Specifically, (1) we use a coarse-grained cache (block-level) based on feature reuse to dynamically bypass redundant computations in encoder-decoder blocks between each step of model reasoning. (2) We design a fine-grained cache (prompt-level) that acts within a module, where the fine-grained cache reuses cross-attention maps within consecutive reasoning steps and extends them to the corresponding module computations of adjacent steps. These caches of different granularities can be seamlessly integrated into each computational link of the controllable generation process. We verify the effectiveness of HGC on four benchmark datasets, especially its advantages in balancing generation efficiency and visual quality. For example, on the COCO-Stuff segmentation benchmark, our HGC significantly reduces the computational cost (MACs) by 63% (from 18.22T → 6.70T↓), while keeping the loss of semantic fidelity (quantized performance degradation) within 1.5%.

Downloads

Published

2026-03-14

How to Cite

Liu, L., Ben, H., Wang, S., Lu, J., Qiu, J., Tang, S., & Hao, Y. (2026). Accelerating Controllable Generation via Hybrid-grained Cache. Proceedings of the AAAI Conference on Artificial Intelligence, 40(9), 7215–7223. https://doi.org/10.1609/aaai.v40i9.37658

Issue

Section

AAAI Technical Track on Computer Vision VI