Deconstructing equivariant representations in molecular systems

Kin Long Kelvin Lee; Mikhail Galkin; Santiago Miret

Deconstructing equivariant representations in molecular systems

Kin Long Kelvin Lee, Mikhail Galkin, Santiago Miret

TL;DR

This work analyzes how equivariant representations in molecular graph models encode information for scalar property prediction on QM9. Using a simple GNN with spherical-harmonic embeddings up to order $L$ and PHATE-based latent-space analyses, the authors find that higher-order irreps (notably $l=1$ and $l=2$) are often unused and can degrade performance when included. Pruning these orders (e.g., using $L=[0,3,4,5,6]$) yields substantial gains and clearer latent structure, suggesting that $L$ should be treated as a tunable hyperparameter rather than a convergence requirement. The study proposes regularization, targeted pruning, and equivariant-pretraining as practical directions to improve efficiency and utilization of equivariant features in tensor-product based models, and provides a methodological framework for diagnosing latent representations in such systems.

Abstract

Recent equivariant models have shown significant progress in not just chemical property prediction, but as surrogates for dynamical simulations of molecules and materials. Many of the top performing models in this category are built within the framework of tensor products, which preserves equivariance by restricting interactions and transformations to those that are allowed by symmetry selection rules. Despite being a core part of the modeling process, there has not yet been much attention into understanding what information persists in these equivariant representations, and their general behavior outside of benchmark metrics. In this work, we report on a set of experiments using a simple equivariant graph convolution model on the QM9 dataset, focusing on correlating quantitative performance with the resulting molecular graph embeddings. Our key finding is that, for a scalar prediction task, many of the irreducible representations are simply ignored during training -- specifically those pertaining to vector ($l=1$) and tensor quantities ($l=2$) -- an issue that does not necessarily make itself evident in the test metric. We empirically show that removing some unused orders of spherical harmonics improves model performance, correlating with improved latent space structure. We provide a number of recommendations for future experiments to try and improve efficiency and utilization of equivariant features based on these observations.

Deconstructing equivariant representations in molecular systems

TL;DR

and PHATE-based latent-space analyses, the authors find that higher-order irreps (notably

and

) are often unused and can degrade performance when included. Pruning these orders (e.g., using

) yields substantial gains and clearer latent structure, suggesting that

should be treated as a tunable hyperparameter rather than a convergence requirement. The study proposes regularization, targeted pruning, and equivariant-pretraining as practical directions to improve efficiency and utilization of equivariant features in tensor-product based models, and provides a methodological framework for diagnosing latent representations in such systems.

Abstract

) and tensor quantities (

) -- an issue that does not necessarily make itself evident in the test metric. We empirically show that removing some unused orders of spherical harmonics improves model performance, correlating with improved latent space structure. We provide a number of recommendations for future experiments to try and improve efficiency and utilization of equivariant features based on these observations.

Deconstructing equivariant representations in molecular systems

TL;DR

Abstract

Deconstructing equivariant representations in molecular systems

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)