Target-Balanced Score Distillation

Authors

  • Zhou Xu Tsinghua University
  • Qi Wang Xidian University
  • Yuxiao Yang Tsinghua University
  • Luyuan Zhang Tsinghua University
  • Zhang Liang Xidian University
  • Yang Li Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v40i14.38131

Abstract

Score Distillation Sampling (SDS) enables 3D asset generation by distilling priors from pretrained 2D text-to-image diffusion models, but vanilla SDS suffers from over-saturation and over-smoothing. To mitigate this issue, recent variants have incorporated negative prompts. However, these methods face a critical trade-off: limited texture optimization, or significant texture gains with shape distortion. In this work, we first conduct a systematic analysis and reveal that this trade-off is fundamentally governed by the utilization of the negative prompts, where Target Negative Prompts (TNP) that embed target information in the negative prompts dramatically enhancing texture realism and fidelity but inducing shape distortions. Informed by this key insight, we introduce the Target-Balanced Score Distillation (TBSD). It formulates generation as a multi-objective optimization problem and introduces an adaptive strategy that effectively resolves the aforementioned trade-off. Extensive experiments demonstrate that TBSD significantly outperforms existing state-of-the-art methods, yielding 3D assets with high-fidelity textures and geometrically accurate shape.

Downloads

Published

2026-03-14

How to Cite

Xu, Z., Wang, Q., Yang, Y., Zhang, L., Liang, Z., & Li, Y. (2026). Target-Balanced Score Distillation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(14), 11487–11495. https://doi.org/10.1609/aaai.v40i14.38131

Issue

Section

AAAI Technical Track on Computer Vision XI