TEMPO: Temporal Multi-scale Autoregressive Generation of Protein Conformational Ensembles
Yaoyao Xu, Di Wang, Zihan Zhou, Tianshu Yu, Mingchen Chen
TL;DR
TEMPO addresses the challenge of generating temporally coherent, physically plausible protein dynamics by introducing a two-scale autoregressive framework that couples slow, global conformational transitions with fast, local fluctuations through stochastic differential equations. A spatiotemporal encoder with SE(3)–equivariant representation and invariant attention enables stable, physics-informed trajectory generation, while hierarchical inference anchors detailed dynamics to coarse-grained states. Across mdCATH and ATLAS, TEMPO achieves state-of-the-art performance on structural and thermodynamic metrics, significantly outperforming baselines in trajectory fidelity and efficiency (approximately 22 seconds to generate 400 frames on a single A100). These results demonstrate the practicality of trajectory-aware, multi-scale generative modeling for protein dynamics, offering a scalable alternative to MD with strong biomechanical realism. TEMPO’s framework also provides a foundation for future extensions to side-chain dynamics, ligand binding, and multi-molecule systems.
Abstract
Understanding the dynamic behavior of proteins is critical to elucidating their functional mechanisms, yet generating realistic, temporally coherent trajectories of protein ensembles remains a significant challenge. In this work, we introduce a novel hierarchical autoregressive framework for modeling protein dynamics that leverages the intrinsic multi-scale organization of molecular motions. Unlike existing methods that focus on generating static conformational ensembles or treat dynamic sampling as an independent process, our approach characterizes protein dynamics as a Markovian process. The framework employs a two-scale architecture: a low-resolution model captures slow, collective motions driving major conformational transitions, while a high-resolution model generates detailed local fluctuations conditioned on these large-scale movements. This hierarchical design ensures that the causal dependencies inherent in protein dynamics are preserved, enabling the generation of temporally coherent and physically realistic trajectories. By bridging high-level biophysical principles with state-of-the-art generative modeling, our approach provides an efficient framework for simulating protein dynamics that balances computational efficiency with physical accuracy.
