TARA: Token-Aware LoRA for Composable Personalization in Diffusion Models

Yuqi Peng; Lingtao Zheng; Yufeng Yang; Yi Huang; Mingfu Yan; Jianzhuang Liu; Shifeng Chen

doi:10.1609/aaai.v40i10.37788

Authors

Yuqi Peng Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences Northeastern University
Lingtao Zheng Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Yufeng Yang Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Yi Huang vivo AI Lab Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Mingfu Yan Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Jianzhuang Liu Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Shifeng Chen Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences Shenzhen University of Advanced Technology

DOI:

https://doi.org/10.1609/aaai.v40i10.37788

Abstract

Personalized text-to-image generation aims to synthesize novel images of a specific subject or style using only a few reference images. Recent methods based on Low-Rank Adaptation (LoRA) enable efficient single-concept customization by injecting lightweight, concept-specific adapters into pre-trained diffusion models. However, combining multiple LoRA modules for multi-concept generation often leads to identity missing and visual feature leakage. In this work, we identify two key issues behind these failures: (1) token-wise interference among different LoRA modules, and (2) spatial misalignment between the attention map of a rare token and its corresponding concept-specific region. To address these issues, we propose Token-Aware LoRA (TARA), which introduces a token mask to explicitly constrain each module to focus on its associated rare token to avoid interference, and a training objective that encourages the spatial attention of a rare token to align with its concept region. Our method enables training-free multi-concept composition by directly injecting multiple independently trained TARA modules at inference time. Experimental results demonstrate that TARA enables efficient multi-concept inference and effectively preserving the visual identity of each concept by avoiding mutual interference between LoRA modules.

TARA: Token-Aware LoRA for Composable Personalization in Diffusion Models

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information