Table of Contents
Fetching ...

Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint

Pingbang Hu, Mahito Sugiyama

TL;DR

The paper targets data augmentation in regimes with limited labels by avoiding training generative models. It proposes a learning-free, energy-based augmentation framework built on information-geometric insights, modeling data as distributions on a curved manifold $\mathcal{S}$ and performing controllable augmentation via forward projection to a base sub-manifold $\mathcal{B}$ and backward projection from a local data sub-manifold $\mathcal{D}$. The approach relies on a log-linear model on posets to encode structured data, and uses dually-flat coordinates $(\theta, \eta)$ to enable efficient convex projections, with many-body sub-manifolds $\mathcal{M}_{\ell}$ controlling augmentation freedom. Empirical results across image, audio, and tabular datasets show competitive or superior performance compared to baselines, along with improved controllability and reduced variance on challenging small datasets. The method offers a scalable, interpretable alternative to learning-based augmentation, applicable broadly across modalities.

Abstract

We propose a simple yet novel data augmentation method for general data modalities based on energy-based modeling and principles from information geometry. Unlike most existing learning-based data augmentation methods, which rely on learning latent representations with generative models, our proposed framework enables an intuitive construction of a geometrically aware latent space that represents the structure of the data itself, supporting efficient and explicit encoding and decoding procedures. We then present and discuss how to design latent spaces that will subsequently control the augmentation with the proposed algorithm. Empirical results demonstrate that our data augmentation method achieves competitive performance in downstream tasks compared to other baselines, while offering fine-grained controllability that is lacking in the existing literature.

Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint

TL;DR

The paper targets data augmentation in regimes with limited labels by avoiding training generative models. It proposes a learning-free, energy-based augmentation framework built on information-geometric insights, modeling data as distributions on a curved manifold and performing controllable augmentation via forward projection to a base sub-manifold and backward projection from a local data sub-manifold . The approach relies on a log-linear model on posets to encode structured data, and uses dually-flat coordinates to enable efficient convex projections, with many-body sub-manifolds controlling augmentation freedom. Empirical results across image, audio, and tabular datasets show competitive or superior performance compared to baselines, along with improved controllability and reduced variance on challenging small datasets. The method offers a scalable, interpretable alternative to learning-based augmentation, applicable broadly across modalities.

Abstract

We propose a simple yet novel data augmentation method for general data modalities based on energy-based modeling and principles from information geometry. Unlike most existing learning-based data augmentation methods, which rely on learning latent representations with generative models, our proposed framework enables an intuitive construction of a geometrically aware latent space that represents the structure of the data itself, supporting efficient and explicit encoding and decoding procedures. We then present and discuss how to design latent spaces that will subsequently control the augmentation with the proposed algorithm. Empirical results demonstrate that our data augmentation method achieves competitive performance in downstream tasks compared to other baselines, while offering fine-grained controllability that is lacking in the existing literature.
Paper Structure (50 sections, 6 equations, 14 figures, 5 tables, 2 algorithms)

This paper contains 50 sections, 6 equations, 14 figures, 5 tables, 2 algorithms.

Figures (14)

  • Figure 1: Given structured data, we first design a poset $\Omega$ that reflects the structure or the relationship between features. The resulting real-valued poset is then embedded into the statistical manifold $\mathcal{S}$ as a discrete probability distribution $p_\theta (x)$ via an embedding $\varphi$. Finally, the log-linear model on posets provides the dually-flat coordinates $(\theta , \eta )$ for $p_{\theta }$, which can be computed efficiently (\ref{['subsec:statistical-manifold-on-posets']}).
  • Figure 2: Natural poset structure of $3^{\text{rd} }$-order tensors in $\mathbb{R} ^{3 \times 3 \times 3}$.
  • Figure 3: Illustration of forward and backward projection. Here, $w_i$: latent representation of the original data $z_i$, obtained from forward projection to $\mathcal{B}$; $w^{\ast}$: augmented latent representation; $w^{\ast} \mapsto z^{\prime\ast}$: backward projection to $\mathcal{D}$, obtained from the original data of the nearest neighbor(s) of $w^{\ast}$ in the latent space.
  • Figure 4: Mode interaction and approximation.
  • Figure 5: Interpolation energy in different geometries.
  • ...and 9 more figures

Theorems & Definitions (7)

  • Remark 3.1
  • Remark 3.2
  • Example 4.1: Positive tensor
  • Remark 4.2: Invariance
  • Remark 4.3: Positive tensor
  • Example A.1: Many-body approximation
  • Remark C.1