Supervised learning with probabilistic morphisms and kernel mean embeddings
Hông Vân Lê
TL;DR
The paper introduces a generative model of supervised learning that unifies density estimation, regression, and conditional probability estimation through the concept of a correct loss function and probabilistic morphisms. It develops a kernel mean embedding framework to characterize regular conditional probabilities as minimizers of mean-squared error in RKHS, enabling concrete instantaneous losses and measurability properties. A generalized Cucker-Smale result is established for learnability of conditional probability estimation via C-ERM algorithms, supported by finite-sample bounds and covering-number analyses. Additionally, a Vapnik-Stefanyuk–type regularization scheme based on inner measure is developed to address stochastic ill-posed problems and prove generalizability for overparameterized models. The work emphasizes inner/outer measure-based consistency notions and extends classical results to a broader, RKHS-enabled setting with practical implications for overparameterized discriminative modeling.
Abstract
In this paper I propose a generative model of supervised learning that unifies two approaches to supervised learning, using a concept of a correct loss function. Addressing two measurability problems, which have been ignored in statistical learning theory, I propose to use convergence in outer probability to characterize the consistency of a learning algorithm. Building upon these results, I extend a result due to Cucker-Smale, which addresses the learnability of a regression model, to the setting of a conditional probability estimation problem. Additionally, I present a variant of Vapnik-Stefanuyk's regularization method for solving stochastic ill-posed problems, and using it to prove the generalizability of overparameterized supervised learning models.
