OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation

Authors

  • Zhanpeng Wang Dalian University of Technology
  • Shuting Cao Dalian University of Technology
  • Yuhang Lu Dalian University of Technology
  • YuhanLi Dalian University of Technology
  • Na Lei Dalian University of Technology
  • Zhongxuan Luo Dalian University of Technology

DOI:

https://doi.org/10.1609/aaai.v40i31.39886

Abstract

The Dual Diffusion Implicit Bridge (DDIB) is an emerging image-to-image (I2I) translation method that preserves cycle consistency while achieving strong flexibility. It links two independently trained diffusion models (DMs) in the source and target domains by first adding noise to a source image to obtain a latent code, then denoising it in the target domain to generate the translated image. However, this method faces two key challenges: (1) low translation efficiency, and (2) translation trajectory deviations caused by mismatched latent distributions. To address these issues, we propose a novel I2I translation framework, OT-ALD, grounded in optimal transport (OT) theory, which retains the strengths of DDIB-based approach. Specifically, we compute an OT map from the latent distribution of the source domain to that of the target domain, and use the mapped distribution as the starting point for the reverse diffusion process in the target domain. Our error analysis confirms that OT-ALD eliminates latent distribution mismatches. Moreover, OT-ALD effectively balances faster image translation with improved image quality. Experiments on four translation tasks across three high-resolution datasets show that OT-ALD improves sampling efficiency by 20.29% and reduces the FID score by 2.6 on average compared to the top-performing baseline models.

Downloads

Published

2026-03-14

How to Cite

Wang, Z., Cao, S., Lu, Y., , Y., Lei, N., & Luo, Z. (2026). OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(31), 26760–26768. https://doi.org/10.1609/aaai.v40i31.39886

Issue

Section

AAAI Technical Track on Machine Learning VIII