ConfRover: Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression
Yuning Shen, Lihao Wang, Huizhuo Yuan, Yan Wang, Bangji Yang, Quanquan Gu
TL;DR
ConfRover introduces an autoregressive framework that jointly learns protein conformational distributions and dynamics from MD trajectories, enabling both time-dependent trajectory generation and time-independent sampling within a single model. The architecture combines a protein-aware encoder, a causal latent trajectory module, and an SE(3) diffusion-based structure decoder to generate continuous-space conformations frame-by-frame. Empirical results on the ATLAS dataset show ConfRover outperforms state-of-the-art trajectory models in capturing dynamic magnitudes and state transitions, while achieving competitive quality in time-independent sampling and enabling interpolation between end states with ConfRover-interp. The approach offers a flexible, scalable path toward unified modeling of protein dynamics that can accelerate exploration of conformational spaces and dynamics, potentially speeding up protein design and understanding of functional motions.
Abstract
Understanding protein dynamics is critical for elucidating their biological functions. The increasing availability of molecular dynamics (MD) data enables the training of deep generative models to efficiently explore the conformational space of proteins. However, existing approaches either fail to explicitly capture the temporal dependencies between conformations or do not support direct generation of time-independent samples. To address these limitations, we introduce ConfRover, an autoregressive model that simultaneously learns protein conformation and dynamics from MD trajectories, supporting both time-dependent and time-independent sampling. At the core of our model is a modular architecture comprising: (i) an encoding layer, adapted from protein folding models, that embeds protein-specific information and conformation at each time frame into a latent space; (ii) a temporal module, a sequence model that captures conformational dynamics across frames; and (iii) an SE(3) diffusion model as the structure decoder, generating conformations in continuous space. Experiments on ATLAS, a large-scale protein MD dataset of diverse structures, demonstrate the effectiveness of our model in learning conformational dynamics and supporting a wide range of downstream tasks. ConfRover is the first model to sample both protein conformations and trajectories within a single framework, offering a novel and flexible approach for learning from protein MD data. Project website: https://bytedance-seed.github.io/ConfRover.
