CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks

Authors

  • Wenfeng Song College of Computer Science, Beijing Information Science and Technology University
  • Yang Ding College of Computer Science, Beijing Information Science and Technology University
  • Fei Hou Key Laboratory of System Software (CAS), State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, China University of Chinese Academy of Sciences, China
  • Shuai Li State Key Laboratory of Virtual Reality Technology and Systems, Beihang University Zhongguancun Laboratory, China
  • Aimin Hao State Key Laboratory of Virtual Reality Technology and Systems, Beihang University
  • Xia Hou College of Computer Science, Beijing Information Science and Technology University

DOI:

https://doi.org/10.1609/aaai.v39i7.32747

Abstract

As virtual experiences grow in popularity, the demand for realistic, personalized, and animatable human avatars increases. Traditional methods, relying on fixed templates, often produce costly avatars that lack expressiveness and realism. To overcome these challenges, we introduce Controllable Avatars generation via disentangled invertible networks (CtrlAvatar), a real-time framework for generating lifelike and customizable avatars. CtrlAvatar uses disentangled invertible networks to separate the deformation process into implicit body geometry and explicit texture components. This approach eliminates the need for repeated occupancy reconstruction, enabling detailed and coherent animations. The body geometry component ensures anatomical accuracy, while the texture component allows for complex, artifact-free clothing customization. This architecture ensures smooth integration between body movements and surface details. By optimizing transformations with position-varying offsets from the avatar’s initial Linear Blend Skinning vertices, CtrlAvatar achieves flexible, natural deformations that adapt to various scenarios. Extensive experiments show that CtrlAvatar outperforms other methods in quality, diversity, controllability, and cost-efficiency, marking a significant advancement in avatar generation.

Downloads

Published

2025-04-11

How to Cite

Song, W., Ding, Y., Hou, F., Li, S., Hao, A., & Hou, X. (2025). CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 39(7), 6959–6967. https://doi.org/10.1609/aaai.v39i7.32747

Issue

Section

AAAI Technical Track on Computer Vision VI