CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks
DOI:
https://doi.org/10.1609/aaai.v39i7.32747Abstract
As virtual experiences grow in popularity, the demand for realistic, personalized, and animatable human avatars increases. Traditional methods, relying on fixed templates, often produce costly avatars that lack expressiveness and realism. To overcome these challenges, we introduce Controllable Avatars generation via disentangled invertible networks (CtrlAvatar), a real-time framework for generating lifelike and customizable avatars. CtrlAvatar uses disentangled invertible networks to separate the deformation process into implicit body geometry and explicit texture components. This approach eliminates the need for repeated occupancy reconstruction, enabling detailed and coherent animations. The body geometry component ensures anatomical accuracy, while the texture component allows for complex, artifact-free clothing customization. This architecture ensures smooth integration between body movements and surface details. By optimizing transformations with position-varying offsets from the avatar’s initial Linear Blend Skinning vertices, CtrlAvatar achieves flexible, natural deformations that adapt to various scenarios. Extensive experiments show that CtrlAvatar outperforms other methods in quality, diversity, controllability, and cost-efficiency, marking a significant advancement in avatar generation.Downloads
Published
2025-04-11
How to Cite
Song, W., Ding, Y., Hou, F., Li, S., Hao, A., & Hou, X. (2025). CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 39(7), 6959–6967. https://doi.org/10.1609/aaai.v39i7.32747
Issue
Section
AAAI Technical Track on Computer Vision VI