IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story Generation

Donghao Zhou; Jingyu Lin; Guibao Shen; Quande Liu; Jialin Gao; Lihao Liu; Lan Du; Cunjian Chen; Chi-Wing Fu; Xiaowei Hu; Pheng-Ann Heng

doi:10.1609/aaai.v40i16.38365

Authors

Donghao Zhou The Chinese University of Hong Kong
Jingyu Lin Monash University
Guibao Shen The Hong Kong University of Science and Technology (Guangzhou)
Quande Liu Kling Team, Kuaishou Technology
Jialin Gao The Chinese University of Hong Kong
Lihao Liu Amazon
Lan Du Monash University
Cunjian Chen Monash University
Chi-Wing Fu The Chinese University of Hong Kong
Xiaowei Hu South China University of Technology
Pheng-Ann Heng The Chinese University of Hong Kong

DOI:

https://doi.org/10.1609/aaai.v40i16.38365

Abstract

Recent visual generative models enable story generation with consistent characters from text, but human-centric story generation faces additional challenges, such as maintaining detailed and diverse human face consistency and coordinating multiple characters across different images. This paper presents IdentityStory, a framework for human-centric story generation that ensures consistent character identity across multiple sequential images. By taming identity-preserving generators, the framework features two key components: Iterative Identity Discovery, which extracts cohesive character identities, and Re-denoising Identity Injection, which re-denoises images to inject identities while preserving desired context. Experiments on the ConsiStory-Human benchmark demonstrate that IdentityStory outperforms existing methods, particularly in face consistency, and supports multi-character combinations. The framework also shows strong potential for applications such as infinite-length story generation and dynamic character composition.

IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story Generation

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information