PHR-DIFF: Portrait Highlights Removal via Patch-aware Diffusion Model

Authors

  • Hongsheng Zheng School of Computer Science, Wuhan University, Wuhan, China
  • Zhongyun Bao School of Computer and Information, Anhui Polytechnic University, Wuhu, China
  • Gang Fu Department of Computing, The Hong Kong Polytechnic University, Hong Kong SAR, China
  • Xuze Jiao School of Computer Science, Wuhan University, Wuhan, China
  • Chunxia Xiao School of Computer Science, Wuhan University, Wuhan, China

DOI:

https://doi.org/10.1609/aaai.v39i10.33146

Abstract

Portraits often suffer from specular highlights due to factors like skin oiliness, lighting conditions, and shooting angles, which degrade aesthetics and affect downstream tasks. Thus, portrait highlight removal is imperative. Previous methods struggle to remove highlights and achieve high-fidelity restoration of disturbed regions simultaneously. In this work, we propose a novel patch-based diffusion model for this task, named PHR-DIFF. Specifically, in the training, we present a patchify training strategy that divides the portrait into equal-sized patches and performs diffusion on these patches individually. This patchify can extract more compact facial features and reduce training costs. Besides, to learn the global coherence of the face, we propose a patch-residual approach. It encodes the full-resolution highlight-free portrait into latent features, which are further used as residual terms to constrain the forward training. In the sampling, we remove portrait highlights in a patch-wise manner and propose a Patch-Aware Highlight Removal (PAHR) mechanism. PAHR leverages features from non-highlight regions to effectively guide the patch-wise removal of highlight components. Experimental results on multiple public datasets demonstrate that PHR-DIFF removes highlights more cleanly and avoids artifacts.

Downloads

Published

2025-04-11

How to Cite

Zheng, H., Bao, Z., Fu, G., Jiao, X., & Xiao, C. (2025). PHR-DIFF: Portrait Highlights Removal via Patch-aware Diffusion Model. Proceedings of the AAAI Conference on Artificial Intelligence, 39(10), 10555–10563. https://doi.org/10.1609/aaai.v39i10.33146

Issue

Section

AAAI Technical Track on Computer Vision IX