Generative vs. Discriminative modeling under the lens of uncertainty quantification

Elouan Argouarc'h; François Desbouvries; Eric Barat; Eiji Kawasaki

Generative vs. Discriminative modeling under the lens of uncertainty quantification

Elouan Argouarc'h, François Desbouvries, Eric Barat, Eiji Kawasaki

TL;DR

This work analyzes generative versus discriminative modeling through the lens of epistemic uncertainty quantified by the posterior predictive distribution ($ppd$). It derives how the two approaches differ in model posteriors, joint distributions, and priors, showing that generative models can leverage explicit priors and unlabeled data, while discriminative models rely on implicit priors and struggle with multi-observation and imbalanced data. A Gibbs-sampling framework is proposed to sample from the $ppd$ for both paradigms, with special emphasis on semi-supervised learning where only the generative approach can exploit unlabeled observations to reduce epistemic uncertainty. Simulations on affine models and neural-network-based image classification substantiate the theoretical claims, highlighting robustness to class imbalance and benefits of semi-supervised learning for generative formulations. The findings provide practical guidance for uncertainty-aware learning, favoring generative approaches in settings with limited labels or abundant unlabeled data, especially when reliable prior information is available or when extrapolation beyond the labeled support is required.

Abstract

Learning a parametric model from a given dataset indeed enables to capture intrinsic dependencies between random variables via a parametric conditional probability distribution and in turn predict the value of a label variable given observed variables. In this paper, we undertake a comparative analysis of generative and discriminative approaches which differ in their construction and the structure of the underlying inference problem. Our objective is to compare the ability of both approaches to leverage information from various sources in an epistemic uncertainty aware inference via the posterior predictive distribution. We assess the role of a prior distribution, explicit in the generative case and implicit in the discriminative case, leading to a discussion about discriminative models suffering from imbalanced dataset. We next examine the double role played by the observed variables in the generative case, and discuss the compatibility of both approaches with semi-supervised learning. We also provide with practical insights and we examine how the modeling choice impacts the sampling from the posterior predictive distribution. With regard to this, we propose a general sampling scheme enabling supervised learning for both approaches, as well as semi-supervised learning when compatible with the considered modeling approach. Throughout this paper, we illustrate our arguments and conclusions using the example of affine regression, and validate our comparative analysis through classification simulations using neural network based models.

Generative vs. Discriminative modeling under the lens of uncertainty quantification

TL;DR

This work analyzes generative versus discriminative modeling through the lens of epistemic uncertainty quantified by the posterior predictive distribution (

). It derives how the two approaches differ in model posteriors, joint distributions, and priors, showing that generative models can leverage explicit priors and unlabeled data, while discriminative models rely on implicit priors and struggle with multi-observation and imbalanced data. A Gibbs-sampling framework is proposed to sample from the

for both paradigms, with special emphasis on semi-supervised learning where only the generative approach can exploit unlabeled observations to reduce epistemic uncertainty. Simulations on affine models and neural-network-based image classification substantiate the theoretical claims, highlighting robustness to class imbalance and benefits of semi-supervised learning for generative formulations. The findings provide practical guidance for uncertainty-aware learning, favoring generative approaches in settings with limited labels or abundant unlabeled data, especially when reliable prior information is available or when extrapolation beyond the labeled support is required.

Abstract

Paper Structure (17 sections, 27 equations, 14 figures, 1 table, 2 algorithms)

This paper contains 17 sections, 27 equations, 14 figures, 1 table, 2 algorithms.

Introduction
Supervised learning: Context and objective
Generative versus Discriminative modeling
Handling multiple observations
Supervised Epistemic Uncertainty via the ppd
model posterior: $p(\theta|y_0,\mathcal{D})$ or $p(\theta|\mathcal{D}) ?$
Joint pdf
The ppd
Explicit or Implicit prior
Classification example
Gibbs sampling from the ppd
Bayesian Semi-Supervised learning
The learning task
Both modeling confronted to the semi-supervised learning task
A Gibbs sampling algorithm for semi-supervised learning
...and 2 more sections

Figures (14)

Figure 1: Supervised learning setting
Figure 2: Illustration of the difference between generative and discriminative modeling approaches (right)
Figure 3: Graphical models compared: Generative (left) versus Discriminative (right). The grey (resp. white) nodes are observed (resp. latent) variables.
Figure 4: Empirical estimate of $p(x_0|\mathcal{D})$ in the discriminative setting: samples are obtained via $y\sim \mathcal{P}_{Y_0}$ (top) followed by $x_0\sim p(x_0|y,\mathcal{D})$ (bottom). This distribution corresponds to a trade-off between $\Pi_{X_0}$ and $\mathcal{P}_X^{\mathcal{D}}$ (right)
Figure 5: Varying degrees of aleatoric uncertainty in the DGP yield the distribution $p(x_0|\mathcal{D})$ to shift between $\Pi_{X_0}$ and $\mathcal{P}_X^{\mathcal{D}}$.
...and 9 more figures

Generative vs. Discriminative modeling under the lens of uncertainty quantification

TL;DR

Abstract

Generative vs. Discriminative modeling under the lens of uncertainty quantification

Authors

TL;DR

Abstract

Table of Contents

Figures (14)