Any-dimensional equivariant neural networks

Eitan Levin; Mateo Díaz

Any-dimensional equivariant neural networks

Eitan Levin, Mateo Díaz

TL;DR

The paper addresses learning mappings defined on inputs of arbitrary dimension by leveraging representation stability to construct free, dimension-free equivariant neural networks that extend from a fixed training dimension to all dimensions. It shows that equivariant linear layers stabilize across dimensions, enabling finite parameterizations for groups such as permutation, orthogonal, and Lorentz-like actions. To mitigate poor cross-dimension generalization, it introduces a compatibility condition that enforces consistent extensions across dimensions and develops a computational recipe to train networks at a single level and extend them to higher or lower dimensions. Numerical experiments demonstrate improved cross-dimension generalization for compatible networks, and an open-source implementation is provided to facilitate adoption and further research.

Abstract

Traditional supervised learning aims to learn an unknown mapping by fitting a function to a set of input-output pairs with a fixed dimension. The fitted function is then defined on inputs of the same dimension. However, in many settings, the unknown mapping takes inputs in any dimension; examples include graph parameters defined on graphs of any size and physics quantities defined on an arbitrary number of particles. We leverage a newly-discovered phenomenon in algebraic topology, called representation stability, to define equivariant neural networks that can be trained with data in a fixed dimension and then extended to accept inputs in any dimension. Our approach is user-friendly, requiring only the network architecture and the groups for equivariance, and can be combined with any training procedure. We provide a simple open-source implementation of our methods and offer preliminary numerical experiments.

Any-dimensional equivariant neural networks

TL;DR

Abstract

Paper Structure (25 sections, 5 theorems, 14 equations, 2 figures, 2 tables, 1 algorithm)

This paper contains 25 sections, 5 theorems, 14 equations, 2 figures, 2 tables, 1 algorithm.

INTRODUCTION
Core contributions.
Notation.
Outline.
Related work
Equivariant learning.
Dimension-free learning.
Representation stability.
FREE NEURAL NETWORKS
GENERALIZATION ACROSS DIMENSIONS AND COMPATIBILITY
Compatibility in the wild.
Compatible linear layers.
Compatible activation functions.
A COMPUTATIONAL RECIPE FOR LEARNING FREE NEURAL NETWORKS
NUMERICAL EXPERIMENTS
...and 10 more sections

Key Result

Proposition 2.3

If $\{{\bm V}\}$ is generated in degree $d$, then the orthogonal projections $\mathcal{P}_{{\bm V}}\colon ({\bm V}^+)^{{\bm G}^+}\to {\bm V}^{{\bm G}}$ are injective for all $n\geq d$, and isomorphisms for all large $n$.

Figures (2)

Figure 1: Equivariant free neural networks. We use bold font to denote the $n$th element in a sequence; see Notation in Section \ref{['par:notation']} for details.
Figure 2: Test errors across dimensions for free and compatible networks. Each experiment is run three times; the lighter bands show the max and min runs, while the bold line shows the average. For diagonal extraction and symmetric projection, we measure the average MSE per entry. Vertical gray lines mark the dimension used for learning.

Theorems & Definitions (18)

Definition 2.1: Consistent sequences
Definition 2.2: Generation degree
Proposition 2.3: Isomorphism of invariants
proof
Definition 3.1: Compatible networks
Theorem 3.2
Definition B.1: Stabilizing subgroups
Definition B.2: Modules
Definition B.3: Induction and algebraically free sequences
Example B.4
...and 8 more

Any-dimensional equivariant neural networks

TL;DR

Abstract

Any-dimensional equivariant neural networks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (18)