CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis

Authors

  • Alec Sargood University College London
  • Lemuel Puglisi University of Catania
  • James H Cole University College London
  • Neil P. Oxtoby University College London
  • Daniele Ravì University of Messina
  • Daniel C. Alexander University College London

DOI:

https://doi.org/10.1609/aaai.v40i11.37831

Abstract

Synthesizing amyloid PET scans from the more widely available and accessible structural MRI modality offers a promising, cost-effective approach for large-scale Alzheimer's Disease (AD) screening. This is motivated by evidence that, while MRI does not directly detect amyloid pathology, it may nonetheless encode information correlated with amyloid deposition that can be uncovered through advanced modeling. However, the high dimensionality and structural complexity of 3D neuroimaging data pose significant challenges for existing MRI-to-PET translation methods. Modeling the cross-modality relationship in a lower-dimensional latent space can simplify the learning task and enable more effective translation. As such, we present CoCoLIT (ControlNet-Conditioned Latent Image Translation), a diffusion-based latent generative framework that incorporates three main innovations: (1) a novel Weighted Image Space Loss (WISL) that improves latent representation learning and synthesis quality; (2) a theoretical and empirical analysis of Latent Average Stabilization (LAS), an existing technique used in similar generative models to enhance inference consistency; and (3) the introduction of ControlNet-based conditioning for MRI-to-PET translation. We evaluate CoCoLIT's performance on publicly available datasets and find that our model significantly outperforms state-of-the-art methods on both image-based and amyloid-related metrics. Notably, in amyloid-positivity classification, CoCoLIT outperforms the second-best method with improvements of +10.5% on the internal dataset and +23.7% on the external dataset.

Downloads

Published

2026-03-14

How to Cite

Sargood, A., Puglisi, L., Cole, J. H., Oxtoby, N. P., Ravì, D., & Alexander, D. C. (2026). CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis. Proceedings of the AAAI Conference on Artificial Intelligence, 40(11), 8778–8786. https://doi.org/10.1609/aaai.v40i11.37831

Issue

Section

AAAI Technical Track on Computer Vision VIII