Enhancing Generalizability in Molecular Conformation Generation with METRIZATION-Informed Geometric Diffusion Pretraining

Authors

  • Xiaozhuang Song Chinese University of Hong Kong, Shenzhen
  • Yuzhao Tu The Chinese University of Hong Kong
  • Hangting Ye Jilin University
  • Wei Fan University of Oxford
  • Qingquan Zhang Southern University of Science and Technology
  • Xiaoxue Wang ChemLex Technology Co., Ltd.
  • Tianshu Yu Chinese University of Hong Kong, Shenzhen

DOI:

https://doi.org/10.1609/aaai.v39i1.32058

Abstract

Diffusion-based generative models have recently excelled in generating molecular conformations but struggled with the generalization issue -- models trained on one dataset may produce meaningless conformations on out-of-distribution molecules. On the other hand, distance geometry serves as a generalizable tool for the traditional computational chemistry methods of molecular conformation, which is predicated on the assumption that it is possible to adequately define the set of all potential conformations of any non-rigid molecular system using purely geometric constraints. In this work, we for the first time explicitly incorporate distance geometry constraints into pretraining phase of diffusion-based molecular generation models to improve the generalizability. Inspired by the classical distance geometry solution designed for solving the molecular distance geometry problem, we propose MiGDiff, a Metrization-Informed Geometric Diffusion framework. MiGDiff injects distance geometry constraints by pretraining the deep geometric diffusion backbone within the Metrization sampling approach, yielding a "Metrization-driven pretraining + Data-driven finetuning" paradigm. Experimental results demonstrate that MiGDiff outperforms state-of-the-art methods and possesses strong generalization capabilities, particularly on generating previously unseen molecules, revealing the vast untapped potential of combining traditional computational methods with deep generative models for 3D molecular generation.

Published

2025-04-11

How to Cite

Song, X., Tu, Y., Ye, H., Fan, W., Zhang, Q., Wang, X., & Yu, T. (2025). Enhancing Generalizability in Molecular Conformation Generation with METRIZATION-Informed Geometric Diffusion Pretraining. Proceedings of the AAAI Conference on Artificial Intelligence, 39(1), 755–763. https://doi.org/10.1609/aaai.v39i1.32058

Issue

Section

AAAI Technical Track on Application Domains