EPO: Diverse and Realistic Protein Ensemble Generation via Energy Preference Optimization
Yuancheng Sun, Yuxuan Ren, Zhaoming Chen, Xu Han, Kang Liu, Qiwei Ye
TL;DR
EPO tackles the difficulty of sampling full Boltzmann protein ensembles by online refinement of pretrained generators using energy-based, listwise preferences. It integrates SDE-based sampling to explore rugged landscapes and adopts a listwise energy ranking with a practical upper bound to align generated ensembles with thermodynamic realism without additional MD trajectories. The approach yields state-of-the-art results across Tetrapeptides, ATLAS, and Fast-Folding benchmarks and demonstrates that energy-only signals can effectively steer generative models toward diverse, physically plausible conformations. This online, MD-free framework potentially broadens the applicability of learned potentials in structural biology and drug discovery by enabling efficient exploration of conformational diversity.
Abstract
Accurate exploration of protein conformational ensembles is essential for uncovering function but remains hard because molecular-dynamics (MD) simulations suffer from high computational costs and energy-barrier trapping. This paper presents Energy Preference Optimization (EPO), an online refinement algorithm that turns a pretrained protein ensemble generator into an energy-aware sampler without extra MD trajectories. Specifically, EPO leverages stochastic differential equation sampling to explore the conformational landscape and incorporates a novel energy-ranking mechanism based on list-wise preference optimization. Crucially, EPO introduces a practical upper bound to efficiently approximate the intractable probability of long sampling trajectories in continuous-time generative models, making it easily adaptable to existing pretrained generators. On Tetrapeptides, ATLAS, and Fast-Folding benchmarks, EPO successfully generates diverse and physically realistic ensembles, establishing a new state-of-the-art in nine evaluation metrics. These results demonstrate that energy-only preference signals can efficiently steer generative models toward thermodynamically consistent conformational ensembles, providing an alternative to long MD simulations and widening the applicability of learned potentials in structural biology and drug discovery.
