Reproducing kernel methods for machine learning, PDEs, and statistics

Philippe G. LeFloch; Jean-Marc Mercier; Shohruh Miryusupov

Reproducing kernel methods for machine learning, PDEs, and statistics

Philippe G. LeFloch, Jean-Marc Mercier, Shohruh Miryusupov

TL;DR

This work develops a unified RKHS framework fused with optimal transport to advance kernel methods across ML, PDEs, and finance. It builds a theory stack—discrete/continuous RKHS, kernel engineering, and discrete kernel operators—then leverages OT concepts (Monge/Kantorovich, GW/GM, Sinkhorn) to design scalable, mesh-free discretizations and generative models. Across applications, it demonstrates kernel ridge regression and related RKHS tools for regression, classification, clustering, and generation, and shows how OT-inspired maps yield sample-efficient, data-driven models with reproducibility guarantees. The practical impact lies in reproducible benchmarks, scalable kernels, and a cohesive pathway from kernel theory to industrial computational physics and mathematical finance tasks. The compilation also provides CodPy-based tooling and a companion site with runnable Python code to diffusion kernel-based RKHS methods.

Abstract

This monograph develops a unified, application-driven framework for kernel methods grounded in reproducing kernel Hilbert spaces (RKHS) and optimal transport (OT). Part I lays the theoretical and numerical foundations on positive-definite kernels; discrete and continuous RKHS; kernel engineering and scaling maps; error assessment via kernel discrepancy/maximum mean discrepancy (MMD); and a systematic operator view of kernels. In this viewpoint, projection, gradient, divergence, and Laplace-Beltrami operators are built directly from kernels, enabling discrete analogues of differential operators and variational tools that connect learning with PDE-style modeling. Part II turns to practice across four domains. In machine learning, we treat supervised and unsupervised tasks, then develop RKHS-based generative modeling, contrasting density and projection approaches and enhancing them with OT and scalable, combinatorial assignments. We introduce clustering strategies that reduce computational burden and support large-scale regression and transport. In physics-informed modeling, we present mesh-free kernel discretizations for elliptic and time-dependent PDEs, discuss automatic differentiation, and propose high-order discrete approximations. In reinforcement learning, we formulate kernel Q-learning and non-parametric HJB methods, and show how kernel operators yield sample-efficient baselines on continuous-state, discrete-action tasks. In mathematical finance, we build nonparametric time-series models and market generators, study benchmarking and extrapolation for pricing, and apply the framework to stress testing and portfolio methods.

Reproducing kernel methods for machine learning, PDEs, and statistics

TL;DR

Abstract

Paper Structure (351 sections, 3 theorems, 336 equations, 89 figures, 17 tables, 6 algorithms)

This paper contains 351 sections, 3 theorems, 336 equations, 89 figures, 17 tables, 6 algorithms.

Introduction
Main objective
Outline of this monograph
About this work
Acknowledgments
A framework based on reproducing kernels and optimal transport
Fundamental notions on reproducing kernels
Discrete reproducing kernel Hilbert spaces
A definition of positive-definite kernels
Kernel-based approximations
Error estimates based on kernel discrepancy
Continuous reproducing kernel Hilbert spaces
Discrete versus continuous
Bochner theorem and universal kernels
Mercer theorem
...and 336 more sections

Key Result

Theorem 2.1

Let $\mathcal{H}$ be a Hilbert space of functions. If $k(\cdot,\cdot)$ is a positive-definite kernel on $\mathcal{H}$, then where $e_j(x)$ is an orthonormal basis of $\mathcal{H}$ defined through the relation

Figures (89)

Figure 2. 1: A list of kernels
Figure 2. 2: Kernels transformed with mappings
Figure 2. 3: Training set $(x,f(x))$ (left) and test set $(z,f(z))$ (right)
Figure 2. 4: A qualitative comparison between kernels
Figure 2. 5: Periodic function extrapolation test with KRR, SVR, FFN, DT, Adaboost, XGBoost, RF
...and 84 more figures

Theorems & Definitions (3)

Theorem 2.1
Theorem 2.2
Theorem 2.3

Reproducing kernel methods for machine learning, PDEs, and statistics

TL;DR

Abstract

Reproducing kernel methods for machine learning, PDEs, and statistics

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (89)

Theorems & Definitions (3)