FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting

Authors

  • Yitong Yang School of Computing and Artificial Intelligence, Shanghai University of Finance and Economics, Shanghai, China
  • Yinglin Wang School of Computing and Artificial Intelligence, Shanghai University of Finance and Economics, Shanghai, China
  • Changshuo Wang Department of Computer Science University College London, London, United Kingdom
  • Huajie Wang Shandong University of Finance and Economics, Shandong, China Jinan Yunwei Software Technology Co., Ltd, Shandong, China
  • Shuting He MoE Key Laboratory of Interdisciplinary Research of Computation and Economics, Shanghai, China School of Computing and Artificial Intelligence, Shanghai University of Finance and Economics, Shanghai, China

DOI:

https://doi.org/10.1609/aaai.v40i14.38164

Abstract

The success of 3DGS in generative and editing applications has sparked growing interest in 3DGS-based style transfer. However, current methods still face two major challenges: (1) multi-view inconsistency often leads to style conflicts, resulting in appearance smoothing and distortion; and (2) heavy reliance on VGG features, which struggle to disentangle style and content from style images, often causing content leakage and excessive stylization. To tackle these issues, we introduce FantasyStyle, a 3DGS-based style transfer framework, and the first to rely entirely on diffusion model distillation. It comprises two key components: (1) Multi-View Frequency Consistency. We enhance cross-view consistency by applying a 3D filter to multi-view noisy latent, selectively reducing low-frequency components to mitigate stylized prior conflicts. (2) Controllable Stylized Distillation. To suppress content leakage from style images, we introduce negative guidance to exclude undesired content. In addition, we identify the limitations of Score Distillation Sampling and Delta Denoising Score in 3D style transfer and remove the reconstruction term accordingly. Building on these insights, we propose a controllable stylized distillation that leverages negative guidance to more effectively optimize the 3D Gaussians. Extensive experiments demonstrate that our method consistently outperforms state-of-the-art approaches, achieving higher stylization quality and visual realism across various scenes and styles.

Downloads

Published

2026-03-14

How to Cite

Yang, Y., Wang, Y., Wang, C., Wang, H., & He, S. (2026). FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting. Proceedings of the AAAI Conference on Artificial Intelligence, 40(14), 11784–11792. https://doi.org/10.1609/aaai.v40i14.38164

Issue

Section

AAAI Technical Track on Computer Vision XI