RAGAR: Retrieval Augmented Personalized Image Generation Guided by Recommendation

Authors

  • Run Ling Northeastern University
  • Wenji Wang Northeastern University
  • Yuting Liu Northeastern University
  • Guibing Guo Northeastern University
  • Haowei Liu Chongqing University of Post and Telecommunications
  • Jian Lu Chongqing University of Post and Telecommunications
  • Quanwei Zhang Zhejiang University
  • Yexing Xu Sun Yat-sen University
  • Shuo Lu University of Chinese Academy of Sciences
  • Yun Wang Sun Yat-sen University
  • Yihua Shao University of Chinese Academy of Sciences
  • Linying Jiang Northeastern University
  • Xingwei Wang Northeastern University

DOI:

https://doi.org/10.1609/aaai.v40i18.38553

Abstract

Personalized image generation is crucial for improving the user experience, as it renders reference images into preferred ones according to user visual preferences. Although effective, existing methods face two main issues. First, existing methods treat all items in the user's historical sequence equally when extracting user preferences, overlooking the varying semantic similarities between historical items and the reference item. Disproportionately high weights for low-similarity items distort user visual preferences for the reference item. Second, existing methods heavily rely on consistency between generated and reference images to optimize generation, which leads to underfitting user preferences and hinders personalization. To address these issues, we propose Retrieval Augmented Personalized Image GenerAtion guided by Recommendation (RAGAR). Our approach uses a retrieval mechanism to assign different weights to historical items according to their similarities to the reference item, thereby extracting more refined users' visual preferences for the reference item. Then we introduce a novel rank task based on the multi-modal ranking model to optimize the personalization of the generated images instead of forcing depend on consistency. Extensive experiments and human evaluations on three real-world datasets demonstrate that RAGAR achieves significant improvements in both personalization and semantic metrics compared to five baselines.

Published

2026-03-14

How to Cite

Ling, R., Wang, W., Liu, Y., Guo, G., Liu, H., Lu, J., … Wang, X. (2026). RAGAR: Retrieval Augmented Personalized Image Generation Guided by Recommendation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(18), 15278–15286. https://doi.org/10.1609/aaai.v40i18.38553

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II