On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation

Joaquín Sánchez García

On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation

Joaquín Sánchez García

TL;DR

This work introduces measure pre-conditioning as a principled approach to modify training data distributions in order to improve convergence and computational efficiency of general parametric ML models, while preserving the limiting problem. It leverages Gamma-convergence and a full learner recovery framework to guarantee that minimizers under pre-conditioned measures converge to the true optimizer, enabling stable domain adaptation and transfer learning. The paper surveys a spectrum of non-parametric, measure-based pre-conditioners (e.g., Wasserstein barycenters, entropic and class-based barycenters, MMD-regularized measures) and connects them to empirical density estimation, OT-based domain adaptation, and WGANs, with theoretical results and numerical demonstrations on tasks like MNIST with Gaussian blur and conditional average-guess adaptation. The framework provides practical recipes for selecting measures, discusses convergence properties under various topologies, and outlines directions for extending measure pre-conditioning to broader ML settings and transfer-learning scenarios.

Abstract

We study a new technique for understanding convergence of learning agents under small modifications of data. We show that such convergence can be understood via an analogue of Fatou's lemma which yields gamma-convergence. We show it's relevance and applications in general machine learning tasks and domain adaptation transfer learning.

On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation

TL;DR

Abstract

Paper Structure (57 sections, 22 theorems, 68 equations, 11 figures)

This paper contains 57 sections, 22 theorems, 68 equations, 11 figures.

Introduction
Organization of this document
Relation to literature
Necessity of non-parametric measure pre-conditioning techniques
Measure pre-conditionings
A mathematical framework admitting pre-conditioning
Formulation of the problem
Convergence of the learning problem
The main question
Main Theorem
A version of the envelope Theorem
No Empirical Probability Measure can Converge in the Total Variation Sense for all Distributions
Example: Linear regression
Measure pre-conditioning approaches
Background and Notation
...and 42 more sections

Key Result

Proposition 1

(Standard convergence results on total loss (not minimizers))

Figures (11)

Figure 1: Unblurred, $\sigma = 0$
Figure 2: Blurred, $\sigma = 0.5$
Figure 3: Blurred, $\sigma = 1$
Figure 4: Loss function during training
Figure 5: Accuracy of the model during training
...and 6 more figures

Theorems & Definitions (55)

Proposition 1
Theorem 2
Definition 1
Remark 1
Theorem 3
Theorem 4
Definition 2
Remark 2
Theorem 5
proof
...and 45 more

On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation

TL;DR

Abstract

On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (55)