Recovering Hidden Degrees of Freedom Using Gaussian Processes

Georg Diez; Nele Dethloff; Gerhard Stock

Recovering Hidden Degrees of Freedom Using Gaussian Processes

Georg Diez, Nele Dethloff, Gerhard Stock

TL;DR

This work tackles the limitation of traditional MD dimensionality reduction methods that ignore temporal structure by introducing a Gaussian Process Variational Autoencoder (GP-VAE) with a time-conditioned latent prior $p(\mathbf{z}|t)$. By employing a Matérn kernel $k_{\nu,\ell}(t,t')$ in the latent space, the method encodes temporal correlations and preserves Markovian dynamics in the reduced representation. The authors demonstrate, first with a 3D toy model and then on a $50\,\mu$s MD trajectory of T4 lysozyme, that GP-VAE can separate dynamically distinct states that are geometrically indistinguishable and reveal functional couplings between structural subunits. This time-aware framework improves the reliability and interpretability of subsequent Markov state model analyses and offers a general approach for uncovering hidden degrees of freedom in complex biomolecular systems.

Abstract

Dimensionality reduction represents a crucial step in extracting meaningful insights from Molecular Dynamics (MD) simulations. Conventional approaches, including linear methods such as principal component analysis as well as various autoencoder architectures, typically operate under the assumption of independent and identically distributed data, disregarding the sequential nature of MD simulations. Here, we introduce a physics-informed representation learning framework that leverages Gaussian Processes combined with variational autoencoders to exploit the temporal dependencies inherent in MD data. Time-dependent kernel functions--such as the Matérn kernel--directly impose the temporal correlation structure of the input coordinates onto a low-dimensional space, preserving Markovianity in the reduced representation while faithfully capturing the essential dynamics. Using a three-dimensional toy model, we demonstrate that this approach can successfully identify and separate dynamically distinct states that are geometrically indistinguishable due to hidden degrees of freedom. Applying the framework to a $50\,μ$s-long MD trajectory of T4 lysozyme, we uncover dynamically distinct conformational substates that previous analyses failed to resolve, revealing functional relationships that become apparent only when temporal correlations are taken into account. This time-aware perspective provides a promising framework for understanding complex biomolecular systems, in which conventional collective variables fail to capture the full dynamical picture.

Recovering Hidden Degrees of Freedom Using Gaussian Processes

TL;DR

. By employing a Matérn kernel

in the latent space, the method encodes temporal correlations and preserves Markovian dynamics in the reduced representation. The authors demonstrate, first with a 3D toy model and then on a

s MD trajectory of T4 lysozyme, that GP-VAE can separate dynamically distinct states that are geometrically indistinguishable and reveal functional couplings between structural subunits. This time-aware framework improves the reliability and interpretability of subsequent Markov state model analyses and offers a general approach for uncovering hidden degrees of freedom in complex biomolecular systems.

Abstract

s-long MD trajectory of T4 lysozyme, we uncover dynamically distinct conformational substates that previous analyses failed to resolve, revealing functional relationships that become apparent only when temporal correlations are taken into account. This time-aware perspective provides a promising framework for understanding complex biomolecular systems, in which conventional collective variables fail to capture the full dynamical picture.

Recovering Hidden Degrees of Freedom Using Gaussian Processes

TL;DR

Abstract

Recovering Hidden Degrees of Freedom Using Gaussian Processes

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)