ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models

Authors

  • Jiaxiang Cheng ByteDance Inc.
  • Pan Xie ByteDance Inc.
  • Xin Xia ByteDance Inc.
  • Jiashi Li ByteDance Inc.
  • Jie Wu ByteDance Inc.
  • Yuxi Ren ByteDance Inc.
  • Huixia Li ByteDance Inc.
  • Xuefeng Xiao ByteDance Inc.
  • Shilei Wen ByteDance Inc.
  • Lean Fu ByteDance Inc.

DOI:

https://doi.org/10.1609/aaai.v39i3.32245

Abstract

Recent advancement in text-to-image models and corresponding personalized technologies enables individuals to generate high-quality and imaginative images. However, they often suffer from limitations when generating images with resolutions outside of their trained domain. To overcome this limitation, we present the resolution adapter \textbf{(ResAdapter)}, a domain-consistent adapter designed for diffusion models to generate images with unrestricted resolutions and aspect ratios. Unlike other multi-resolution generation methods that process images of static resolution with complex post-process operations, ResAdapter directly generates images with the dynamical resolution. Especially, after learning a deep understanding of pure resolution priors, ResAdapter trained on the general dataset, generates resolution-free images with personalized diffusion models while preserving their original style domain. Comprehensive experiments demonstrate that ResAdapter with only 0.5M can process images with flexible resolutions for arbitrary diffusion models. More extended experiments demonstrate that ResAdapter is compatible with other modules for image generation across a broad range of resolutions, and can be integrated into other multi-resolution model for efficiently generating higher-resolution images.

Downloads

Published

2025-04-11

How to Cite

Cheng, J., Xie, P., Xia, X., Li, J., Wu, J., Ren, Y., … Fu, L. (2025). ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models. Proceedings of the AAAI Conference on Artificial Intelligence, 39(3), 2438–2446. https://doi.org/10.1609/aaai.v39i3.32245

Issue

Section

AAAI Technical Track on Computer Vision II