Reducing SO(3) Convolutions to SO(2) for Efficient Equivariant GNNs

Saro Passaro; C. Lawrence Zitnick

Reducing SO(3) Convolutions to SO(2) for Efficient Equivariant GNNs

Saro Passaro, C. Lawrence Zitnick

TL;DR

This work tackles the computational bottleneck of $SO(3)$-equivariant GNNs for 3D data by reducing to $SO(2)$-equivariant convolutions through edge-aligned irreps, yielding $O(L^3)$ complexity instead of $O(L^6)$. It introduces the Equivariant Spherical Channel Network (eSCN), which leverages edge rotations and circular harmonics to perform efficient, Clebsch-Gordan-free convolutions while preserving equivariance. Empirical evaluation on OC20 and OC22 demonstrates state-of-the-art performance in force prediction and relaxed-structure accuracy, with substantial speedups that enable higher angular resolutions (larger $L$) in practice. Overall, the approach provides a scalable pathway for accurate 3D equivariant GNNs with broad implications for chemistry and materials science predictions, by bridging $SO(3)$ and $SO(2)$ formalisms in a computationally efficient framework.

Abstract

Graph neural networks that model 3D data, such as point clouds or atoms, are typically desired to be $SO(3)$ equivariant, i.e., equivariant to 3D rotations. Unfortunately equivariant convolutions, which are a fundamental operation for equivariant networks, increase significantly in computational complexity as higher-order tensors are used. In this paper, we address this issue by reducing the $SO(3)$ convolutions or tensor products to mathematically equivalent convolutions in $SO(2)$ . This is accomplished by aligning the node embeddings' primary axis with the edge vectors, which sparsifies the tensor product and reduces the computational complexity from $O(L^6)$ to $O(L^3)$, where $L$ is the degree of the representation. We demonstrate the potential implications of this improvement by proposing the Equivariant Spherical Channel Network (eSCN), a graph neural network utilizing our novel approach to equivariant convolutions, which achieves state-of-the-art results on the large-scale OC-20 and OC-22 datasets.

Reducing SO(3) Convolutions to SO(2) for Efficient Equivariant GNNs

TL;DR

This work tackles the computational bottleneck of

-equivariant GNNs for 3D data by reducing to

-equivariant convolutions through edge-aligned irreps, yielding

complexity instead of

. It introduces the Equivariant Spherical Channel Network (eSCN), which leverages edge rotations and circular harmonics to perform efficient, Clebsch-Gordan-free convolutions while preserving equivariance. Empirical evaluation on OC20 and OC22 demonstrates state-of-the-art performance in force prediction and relaxed-structure accuracy, with substantial speedups that enable higher angular resolutions (larger

) in practice. Overall, the approach provides a scalable pathway for accurate 3D equivariant GNNs with broad implications for chemistry and materials science predictions, by bridging

and

formalisms in a computationally efficient framework.

Abstract

Graph neural networks that model 3D data, such as point clouds or atoms, are typically desired to be

equivariant, i.e., equivariant to 3D rotations. Unfortunately equivariant convolutions, which are a fundamental operation for equivariant networks, increase significantly in computational complexity as higher-order tensors are used. In this paper, we address this issue by reducing the

convolutions or tensor products to mathematically equivalent convolutions in

. This is accomplished by aligning the node embeddings' primary axis with the edge vectors, which sparsifies the tensor product and reduces the computational complexity from

, where

is the degree of the representation. We demonstrate the potential implications of this improvement by proposing the Equivariant Spherical Channel Network (eSCN), a graph neural network utilizing our novel approach to equivariant convolutions, which achieves state-of-the-art results on the large-scale OC-20 and OC-22 datasets.

Paper Structure (27 sections, 2 theorems, 41 equations, 10 figures, 3 tables)

This paper contains 27 sections, 2 theorems, 41 equations, 10 figures, 3 tables.

Introduction
Related work
Machine learning potentials
e3nn networks
Efficient equivariant convolution
SO(2) formulation
Architecture
Message passing
Message aggregation
Output blocks
Experiments
Implementation details
Results
Ablation studies
Discussion
...and 12 more sections

Key Result

Proposition 3.1

The coefficient $\mathbf{C}^{(l_o, m_o)}_{(l_i, m_i), (l_f, 0)}$ is non-zero only if $m_i = \pm m_o$. Moreover $\mathbf{C}^{(l_o, m)}_{(l_i, m), (l_f, 0)} = \mathbf{C}^{(l_o, -m)}_{(l_i, -m), (l_f, 0)}$ and $\mathbf{C}^{(l_o, m)}_{(l_i, -m), (l_f, 0)} = - \mathbf{C}^{(l_o, -m)}_{(l_i, m), (l_f, 0)}$

Figures (10)

Figure 1: We implement two mathematically equivalent equivariant GNNs. e3nn uses the e3nn PyTorch library, whereas eSCN uses our novel approach. Left: GPU memory allocated (%) at training time with resepct to the maximum degree of the spherical harmonics ($L$). Right: time (h) per epoch by $L$. We fix the number of channels to $C=64$ and remove any non-linearity.
Figure 2: Visual representation of the tensor product $\mathbf{x}^{(2)} \otimes^{1}_{2, 1} \mathbf{Y}^{(l)}(\hat{\mathbf{r}})$ between $\mathbf{x} \in \mathbb{R}^5$ and $\mathbf{Y}^{(1)}(\hat{\mathbf{r}}) \in \mathbb{R}^3$. The computational cost of the tensor product can be lowered if the direction $\hat{\mathbf{r}}$ is aligned with the y-axis. The tensor product is then reduced to the multiplication of the green shaded entries. The orange shaded entries are the non-zero entries in the Clebsch-Gordan matrix.
Figure 3: Visual representation of the Clebsch-Gordan matrices $\mathbf{C}^{(1, m_o)}_{(2, m_i), (1, m_f)} \in \mathbb{R}^{5 \times 3 \times 3}$ and $\mathbf{C}^{(2, m_o)}_{(2, m_i), (1, m_f)} \in \mathbb{R}^{5 \times 3 \times 5}$. We illustrate the sparsity of the Clebsch-Gordan matrices as stated in Proposition \ref{['prop:C-G']}.
Figure 4: Illustration of how the spherical harmonics (top) are reduced to a set of circular harmonics (bottom) when $\theta$ is held constant.
Figure 5: Block diagram of the message passing architecture containing the edge embedding block (yellow), the $SO(2)$ blocks (green) and the point-wise non-linearity (red).
...and 5 more figures

Theorems & Definitions (3)

Proposition 3.1
Definition 2.1
Theorem 2.2

Reducing SO(3) Convolutions to SO(2) for Efficient Equivariant GNNs

TL;DR

Abstract

Reducing SO(3) Convolutions to SO(2) for Efficient Equivariant GNNs

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (3)