Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint

Pingbang Hu; Mahito Sugiyama

Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint

Pingbang Hu, Mahito Sugiyama

TL;DR

The paper targets data augmentation in regimes with limited labels by avoiding training generative models. It proposes a learning-free, energy-based augmentation framework built on information-geometric insights, modeling data as distributions on a curved manifold $\mathcal{S}$ and performing controllable augmentation via forward projection to a base sub-manifold $\mathcal{B}$ and backward projection from a local data sub-manifold $\mathcal{D}$. The approach relies on a log-linear model on posets to encode structured data, and uses dually-flat coordinates $(\theta, \eta)$ to enable efficient convex projections, with many-body sub-manifolds $\mathcal{M}_{\ell}$ controlling augmentation freedom. Empirical results across image, audio, and tabular datasets show competitive or superior performance compared to baselines, along with improved controllability and reduced variance on challenging small datasets. The method offers a scalable, interpretable alternative to learning-based augmentation, applicable broadly across modalities.

Abstract

We propose a simple yet novel data augmentation method for general data modalities based on energy-based modeling and principles from information geometry. Unlike most existing learning-based data augmentation methods, which rely on learning latent representations with generative models, our proposed framework enables an intuitive construction of a geometrically aware latent space that represents the structure of the data itself, supporting efficient and explicit encoding and decoding procedures. We then present and discuss how to design latent spaces that will subsequently control the augmentation with the proposed algorithm. Empirical results demonstrate that our data augmentation method achieves competitive performance in downstream tasks compared to other baselines, while offering fine-grained controllability that is lacking in the existing literature.

Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint

TL;DR

and performing controllable augmentation via forward projection to a base sub-manifold

and backward projection from a local data sub-manifold

. The approach relies on a log-linear model on posets to encode structured data, and uses dually-flat coordinates

to enable efficient convex projections, with many-body sub-manifolds

controlling augmentation freedom. Empirical results across image, audio, and tabular datasets show competitive or superior performance compared to baselines, along with improved controllability and reduced variance on challenging small datasets. The method offers a scalable, interpretable alternative to learning-based augmentation, applicable broadly across modalities.

Abstract

Paper Structure (50 sections, 6 equations, 14 figures, 5 tables, 2 algorithms)

This paper contains 50 sections, 6 equations, 14 figures, 5 tables, 2 algorithms.

Introduction
Related work
Learning-based data augmentation
Learning-free data augmentation
Preliminary
Dually-flatness in information geometry
Statistical manifold on posets
Psuedo-Nonlinear data augmentation
Log-linear model on posets framework
Real-Valued Poset.
Embedding.
Dually-Flat Coordinates.
Forward and backward projection
Forward Projection.
Backward Projection.
...and 35 more sections

Figures (14)

Figure 1: Given structured data, we first design a poset $\Omega$ that reflects the structure or the relationship between features. The resulting real-valued poset is then embedded into the statistical manifold $\mathcal{S}$ as a discrete probability distribution $p_\theta (x)$ via an embedding $\varphi$. Finally, the log-linear model on posets provides the dually-flat coordinates $(\theta , \eta )$ for $p_{\theta }$, which can be computed efficiently (\ref{['subsec:statistical-manifold-on-posets']}).
Figure 2: Natural poset structure of $3^{\text{rd} }$-order tensors in $\mathbb{R} ^{3 \times 3 \times 3}$.
Figure 3: Illustration of forward and backward projection. Here, $w_i$: latent representation of the original data $z_i$, obtained from forward projection to $\mathcal{B}$; $w^{\ast}$: augmented latent representation; $w^{\ast} \mapsto z^{\prime\ast}$: backward projection to $\mathcal{D}$, obtained from the original data of the nearest neighbor(s) of $w^{\ast}$ in the latent space.
Figure 4: Mode interaction and approximation.
Figure 5: Interpolation energy in different geometries.
...and 9 more figures

Theorems & Definitions (7)

Remark 3.1
Remark 3.2
Example 4.1: Positive tensor
Remark 4.2: Invariance
Remark 4.3: Positive tensor
Example A.1: Many-body approximation
Remark C.1

Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint

TL;DR

Abstract

Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint

Authors

TL;DR

Abstract

Table of Contents

Figures (14)

Theorems & Definitions (7)