TIDE: Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation
DOI:
https://doi.org/10.1609/aaai.v40i1.37006Abstract
Diffusion Transformers (DiTs) are a powerful yet underexplored class of generative models compared to U-Net-based diffusion architectures. We propose TIDE—Temporal-aware sparse autoencoders for Interpretable Diffusion transformErs—a framework designed to extract sparse, interpretable activation features across timesteps in DiTs. TIDE effectively captures temporally-varying representations and reveals that DiTs naturally learn hierarchical semantics (e.g., 3D structure, object class, and fine-grained concepts) during large-scale pretraining. Experiments show that TIDE enhances interpretability and controllability while maintaining reasonable generation quality, enabling applications such as safe image editing and style transfer.Published
2026-03-14
How to Cite
Huang, V. S.-J., Zhuo, L., Xin, Y., Wang, Z., Wang, F.-Y., Wang, Y., … Li, H. (2026). TIDE: Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(1), 435–443. https://doi.org/10.1609/aaai.v40i1.37006
Issue
Section
AAAI Technical Track on Application Domains I