Table of Contents
Fetching ...

A categorical characterization of relative entropy on standard Borel spaces

Nicolas Gagne, Prakash Panangaden

TL;DR

The paper extends a categorical treatment of relative entropy from finite sets to standard Borel spaces by leveraging the Giry monad and Rohlin disintegration. It defines a relative-entropy functor $RE:\mathbf{SbStat}\to[0,\infty]$ and proves its convex linearity, lower semicontinuity, and a uniqueness result up to a scalar, thereby generalizing the finite-case $RE_{fin}$ to the measure-theoretic setting. It further shows that $RE$ coincides with the Kullback–Leibler divergence on absolutely coherent morphisms and provides a principled foundation for entropy-based reasoning in learning and Bayesian inversion on standard Borel spaces. This framework clarifies how entropy can be incorporated into learning-theoretic constructions beyond finite domains and sets the stage for future point-free extensions.

Abstract

We give a categorical treatment, in the spirit of Baez and Fritz, of relative entropy for probability distributions defined on standard Borel spaces. We define a category suitable for reasoning about statistical inference on standard Borel spaces. We define relative entropy as a functor into Lawvere's category and we show convexity, lower semicontinuity and uniqueness.

A categorical characterization of relative entropy on standard Borel spaces

TL;DR

The paper extends a categorical treatment of relative entropy from finite sets to standard Borel spaces by leveraging the Giry monad and Rohlin disintegration. It defines a relative-entropy functor and proves its convex linearity, lower semicontinuity, and a uniqueness result up to a scalar, thereby generalizing the finite-case to the measure-theoretic setting. It further shows that coincides with the Kullback–Leibler divergence on absolutely coherent morphisms and provides a principled foundation for entropy-based reasoning in learning and Bayesian inversion on standard Borel spaces. This framework clarifies how entropy can be incorporated into learning-theoretic constructions beyond finite domains and sets the stage for future point-free extensions.

Abstract

We give a categorical treatment, in the spirit of Baez and Fritz, of relative entropy for probability distributions defined on standard Borel spaces. We define a category suitable for reasoning about statistical inference on standard Borel spaces. We define relative entropy as a functor into Lawvere's category and we show convexity, lower semicontinuity and uniqueness.

Paper Structure

This paper contains 11 sections, 12 theorems, 61 equations.

Key Result

Theorem 2.3

Let $(X,p)$ and $(Y,q)$ be two standard Borel spaces equipped with probability measures, where $q$ is the pushforward measure $q := p \circ f^{-1}$ for a Borel measurable function $f: X \rightarrow Y$. Then, there exists a $q$-almost everywhere uniquely determined family of probability measures $\{p

Theorems & Definitions (32)

  • Definition 2.1
  • Definition 2.2
  • Theorem 2.3: Rokhlin49
  • Proposition 2.4
  • Definition 3.1
  • Definition 3.2
  • Definition 3.3
  • Proposition 3.4
  • proof
  • Definition 3.5
  • ...and 22 more