Energy-based models for atomic-resolution protein conformations
Yilun Du, Joshua Meier, Jerry Ma, Rob Fergus, Alexander Rives
TL;DR
The paper presents an atomic-resolution energy-based model (EBM) for protein conformations trained solely on crystal structures, challenging traditional physics-grounded design potentials. It proposes the Atom Transformer, a Transformer-based energy function that scores 64-atom contexts around rotamers and is trained via maximum-likelihood-like objectives using a rotamer library for sampling. On rotamer recovery benchmarks, the Atom Transformer approaches Rosetta in performance, and ensembles narrow the gap, while analyses show learned energies reflect core/surface burial, residue-size dependencies, and hydrogen-bond networks. This data-driven approach demonstrates that neural energy functions can capture relevant physical principles and high-order interactions, offering a path toward flexible, design-oriented protein energy models and future extensions to broader design tasks.
Abstract
We propose an energy-based model (EBM) of protein conformations that operates at atomic scale. The model is trained solely on crystallized protein data. By contrast, existing approaches for scoring conformations use energy functions that incorporate knowledge of physical principles and features that are the complex product of several decades of research and tuning. To evaluate the model, we benchmark on the rotamer recovery task, the problem of predicting the conformation of a side chain from its context within a protein structure, which has been used to evaluate energy functions for protein design. The model achieves performance close to that of the Rosetta energy function, a state-of-the-art method widely used in protein structure prediction and design. An investigation of the model's outputs and hidden representations finds that it captures physicochemical properties relevant to protein energy.
