Grounding Continuous Representations in Geometry: Equivariant Neural Fields
David R Wessels, David M Knigge, Samuele Papa, Riccardo Valperga, Sharvaree Vadgama, Efstratios Gavves, Erik J Bekkers
TL;DR
The paper addresses the lack of geometric inductive biases in conditional neural fields by introducing Equivariant Neural Fields (ENFs) that ground latent conditioning in a geometry-informed latent point cloud. ENFs use a cross-attention mechanism conditioned on bi-invariant geometric attributes and enforce locality via a Gaussian window, yielding a steerability property that links transformations in the field to corresponding transformations in the latent. This grounding enables weight sharing over similar local patterns and supports downstream geometric reasoning, demonstrated across reconstruction, image/shape classification, segmentation, climate forecasting, and generative modeling, with code releases provided. The work advances continuous-field representations by reinserting locality and symmetry into CNFs, improving learning efficiency, generalization, and editing capabilities across diverse data modalities.
Abstract
Conditional Neural Fields (CNFs) are increasingly being leveraged as continuous signal representations, by associating each data-sample with a latent variable that conditions a shared backbone Neural Field (NeF) to reconstruct the sample. However, existing CNF architectures face limitations when using this latent downstream in tasks requiring fine-grained geometric reasoning, such as classification and segmentation. We posit that this results from lack of explicit modelling of geometric information (e.g., locality in the signal or the orientation of a feature) in the latent space of CNFs. As such, we propose Equivariant Neural Fields (ENFs), a novel CNF architecture which uses a geometry-informed cross-attention to condition the NeF on a geometric variable--a latent point cloud of features--that enables an equivariant decoding from latent to field. We show that this approach induces a steerability property by which both field and latent are grounded in geometry and amenable to transformation laws: if the field transforms, the latent representation transforms accordingly--and vice versa. Crucially, this equivariance relation ensures that the latent is capable of (1) representing geometric patterns faithfully, allowing for geometric reasoning in latent space, and (2) weight-sharing over similar local patterns, allowing for efficient learning of datasets of fields. We validate these main properties in a range of tasks including classification, segmentation, forecasting, reconstruction and generative modelling, showing clear improvement over baselines with a geometry-free latent space. Code attached to submission https://github.com/Dafidofff/enf-jax. Code for a clean and minimal repo https://github.com/david-knigge/enf-min-jax.
