Hierarchical geometric deep learning enables scalable analysis of molecular dynamics
Zihan Pengmei, Spencer C. Guo, Chatipat Lorpaiboon, Aaron R. Dinner
TL;DR
This paper tackles the scalability problem of applying geometric GNNs to long-timescale biomolecular dynamics by introducing a token-merging module (TMM) that compresses local structure into fragment-level tokens. Paired with FlashAttention, the approach enables transformer-based nonlocal interaction learning on systems with thousands of residues on a single GPU. Through VAMP and SPIB analyses on ADK and nsp13 datasets, the method demonstrates improved memory efficiency, faster training, and interpretable attention that aligns with known biomolecular motions. The work broadens the applicability of deep learning to complex biomolecular dynamics and provides a scalable, interpretable pipeline for kinetic analysis and MSM construction.
Abstract
Molecular dynamics simulations can generate atomically detailed trajectories of complex systems, but analyzing these dynamics can be challenging when systems lack well-established quantitative descriptors (features). Graph neural networks (GNNs) in which messages are passed between nodes that represent atoms that are spatial neighbors promise to obviate manual feature engineering, but the use of GNNs with biomolecular systems of more than a few hundred residues has been limited in the context of analyzing dynamics by both difficulties in capturing the details of long-range interactions with message passing and the memory and runtime requirements associated with large graphs. Here, we show how local information can be aggregated to reduce memory and runtime requirements without sacrificing atomic detail. We demonstrate that this approach opens the door to analyzing simulations of protein-nucleic acid complexes with thousands of residues on single GPUs within minutes. For systems with hundreds of residues, for which there are sufficient data to make quantitative comparisons, we show that the approach improves performance and interpretability.
