Nonparametric estimation of conditional probability distributions using a generative approach based on conditional push-forward neural networks
Nicola Rares Franco, Lorenzo Tedesco
TL;DR
This work introduces conditional push-forward neural networks (CPFN), a lightweight, nonparametric generative framework for estimating conditional distributions by learning a stochastic map that samples from Y|X via latent variables. CPFN obviates the need for invertible architectures or adversarial training, instead optimizing a KL-based objective and achieving near-asymptotic consistency with a controllable bias. The approach demonstrates strong empirical performance, often outperforming kernel, tree-based, and other deep learning methods in both univariate and multivariate settings, and shows competitive results on real-world UCI datasets. The results suggest CPFN is a practical, scalable tool for conditional density estimation with broad potential applications in probabilistic forecasting and uncertainty quantification.
Abstract
We introduce conditional push-forward neural networks (CPFN), a generative framework for conditional distribution estimation. Instead of directly modeling the conditional density $f_{Y|X}$, CPFN learns a stochastic map $\varphi=\varphi(x,u)$ such that $\varphi(x,U)$ and $Y|X=x$ follow approximately the same law, with $U$ a suitable random vector of pre-defined latent variables. This enables efficient conditional sampling and straightforward estimation of conditional statistics through Monte Carlo methods. The model is trained via an objective function derived from a Kullback-Leibler formulation, without requiring invertibility or adversarial training. We establish a near-asymptotic consistency result and demonstrate experimentally that CPFN can achieve performance competitive with, or even superior to, state-of-the-art methods, including kernel estimators, tree-based algorithms, and popular deep learning techniques, all while remaining lightweight and easy to train.
