Table of Contents
Fetching ...

Lie Group Decompositions for Equivariant Neural Networks

Mircea Mironenco, Patrick Forré

TL;DR

This paper tackles the challenge of designing neural networks equivariant to non-compact, non-abelian Lie groups by introducing a framework that leverages Cartan and Polar decompositions to decompose Haar measure and build global parametrizations between Lie algebras and groups. It replaces the reliance on surjective group exponentials with a principled factorization approach, enabling efficient invariant integration and Lie algebra–based kernel parametrization for groups like GL+(n,R), SL(n,R), and affine G. Kernels are learned in Lie algebra coordinates via a map xi^{-1} and integrated over the group with Monte Carlo methods on the decomposed factors, yielding practical, scalable equivariant layers for affine transformations. Empirical results on affNIST and homNIST show state-of-the-art affine robustness with reduced Monte Carlo sampling, highlighting the method's potential to extend equivariant architectures to a broader class of geometric transformations.

Abstract

Invariance and equivariance to geometrical transformations have proven to be very useful inductive biases when training (convolutional) neural network models, especially in the low-data regime. Much work has focused on the case where the symmetry group employed is compact or abelian, or both. Recent work has explored enlarging the class of transformations used to the case of Lie groups, principally through the use of their Lie algebra, as well as the group exponential and logarithm maps. The applicability of such methods is limited by the fact that depending on the group of interest $G$, the exponential map may not be surjective. Further limitations are encountered when $G$ is neither compact nor abelian. Using the structure and geometry of Lie groups and their homogeneous spaces, we present a framework by which it is possible to work with such groups primarily focusing on the groups $G = \text{GL}^{+}(n, \mathbb{R})$ and $G = \text{SL}(n, \mathbb{R})$, as well as their representation as affine transformations $\mathbb{R}^{n} \rtimes G$. Invariant integration as well as a global parametrization is realized by a decomposition into subgroups and submanifolds which can be handled individually. Under this framework, we show how convolution kernels can be parametrized to build models equivariant with respect to affine transformations. We evaluate the robustness and out-of-distribution generalisation capability of our model on the benchmark affine-invariant classification task, outperforming previous proposals.

Lie Group Decompositions for Equivariant Neural Networks

TL;DR

This paper tackles the challenge of designing neural networks equivariant to non-compact, non-abelian Lie groups by introducing a framework that leverages Cartan and Polar decompositions to decompose Haar measure and build global parametrizations between Lie algebras and groups. It replaces the reliance on surjective group exponentials with a principled factorization approach, enabling efficient invariant integration and Lie algebra–based kernel parametrization for groups like GL+(n,R), SL(n,R), and affine G. Kernels are learned in Lie algebra coordinates via a map xi^{-1} and integrated over the group with Monte Carlo methods on the decomposed factors, yielding practical, scalable equivariant layers for affine transformations. Empirical results on affNIST and homNIST show state-of-the-art affine robustness with reduced Monte Carlo sampling, highlighting the method's potential to extend equivariant architectures to a broader class of geometric transformations.

Abstract

Invariance and equivariance to geometrical transformations have proven to be very useful inductive biases when training (convolutional) neural network models, especially in the low-data regime. Much work has focused on the case where the symmetry group employed is compact or abelian, or both. Recent work has explored enlarging the class of transformations used to the case of Lie groups, principally through the use of their Lie algebra, as well as the group exponential and logarithm maps. The applicability of such methods is limited by the fact that depending on the group of interest , the exponential map may not be surjective. Further limitations are encountered when is neither compact nor abelian. Using the structure and geometry of Lie groups and their homogeneous spaces, we present a framework by which it is possible to work with such groups primarily focusing on the groups and , as well as their representation as affine transformations . Invariant integration as well as a global parametrization is realized by a decomposition into subgroups and submanifolds which can be handled individually. Under this framework, we show how convolution kernels can be parametrized to build models equivariant with respect to affine transformations. We evaluate the robustness and out-of-distribution generalisation capability of our model on the benchmark affine-invariant classification task, outperforming previous proposals.
Paper Structure (44 sections, 30 theorems, 114 equations, 2 figures, 2 tables)

This paper contains 44 sections, 30 theorems, 114 equations, 2 figures, 2 tables.

Key Result

Theorem 4.1

(1) Let $G$ be a locally compact group, $H \leq G$ a closed subgroup, with left Haar measures $\mu_{G}$ and $\mu_{H}$ respectively. There is a $G$-invariant Radon measure $\mu_{G/H}$ on $G/H$ if and only if $\left.\Delta_{G}\right|_{H} = \Delta_{H}$. The measure $\mu_{G / H}$ is unique up to a scala (2) Let $P \leq G$, $K \leq G$ closed subgroups such that $G = PK$. Assume that $P \cap K$ is compa

Figures (2)

  • Figure 1: Equivariance error as a function of the number of MC samples.
  • Figure 2: Test error on affNIST/homNIST as a function of MC samples.

Theorems & Definitions (46)

  • Theorem 4.1
  • proof
  • Theorem 4.2
  • Theorem 4.3
  • Theorem 4.4
  • Theorem A.1
  • Remark A.2
  • Proposition A.3
  • Proposition A.4
  • Proposition B.1
  • ...and 36 more