EPO: Diverse and Realistic Protein Ensemble Generation via Energy Preference Optimization

Authors

  • Yuancheng Sun Institute of Automation, Chinese Academy of Sciences University of Chinese Academy of Sciences Beijing Academy of Artificial Intelligence
  • Yuxuan Ren Beijing Academy of Artificial Intelligence
  • Zhaoming Chen Beijing Academy of Artificial Intelligence
  • Xu Han Institute of Automation, Chinese Academy of Sciences University of Chinese Academy of Sciences Beijing Academy of Artificial Intelligence
  • Kang Liu Institute of Automation, Chinese Academy of Sciences University of Chinese Academy of Sciences
  • Qiwei Ye Beijing Academy of Artificial Intelligence

DOI:

https://doi.org/10.1609/aaai.v40i2.37076

Abstract

Accurate exploration of protein conformational ensembles is essential for uncovering function but remains hard because molecular-dynamics (MD) simulations suffer from high computational costs and energy-barrier trapping. This paper presents Energy Preference Optimization (EPO), an online refinement algorithm that turns a pretrained protein ensemble generator into an energy-aware sampler without extra MD trajectories. Specifically, EPO leverages stochastic differential equation sampling to explore the conformational landscape and incorporates a novel energy-ranking mechanism based on list-wise preference optimization. Crucially, EPO introduces a practical upper bound to efficiently approximate the intractable probability of long sampling trajectories in continuous-time generative models, making it easily adaptable to existing pretrained generators. On Tetrapeptides, ATLAS, and Fast-Folding benchmarks, EPO successfully generates diverse and physically realistic ensembles, establishing a new state-of-the-art in nine evaluation metrics. These results demonstrate that energy-only preference signals can efficiently steer generative models toward thermodynamically consistent conformational ensembles, providing an alternative to long MD simulations and widening the applicability of learned potentials in structural biology and drug discovery.

Downloads

Published

2026-03-14

How to Cite

Sun, Y., Ren, Y., Chen, Z., Han, X., Liu, K., & Ye, Q. (2026). EPO: Diverse and Realistic Protein Ensemble Generation via Energy Preference Optimization. Proceedings of the AAAI Conference on Artificial Intelligence, 40(2), 1060-1068. https://doi.org/10.1609/aaai.v40i2.37076

Issue

Section

AAAI Technical Track on Application Domains II