Table of Contents
Fetching ...

Measure Theory of Conditionally Independent Random Function Evaluation

Felix Benning

TL;DR

The paper develops a rigorous measure-theoretic framework for evaluating random functions at random inputs, addressing when previsible or conditionally independent inputs can be treated as deterministic in conditional calculations. It introduces joint probability kernels for collections of conditional distributions, proves existence and continuity of these kernels under suitable regularity, and provides Gaussian-process special cases with explicit, continuous forms. By relating $\mathbb{E}[f(X)|F]$ to a measurable function $H$ and establishing continuity in the index, the work justifies common heuristics used in Bayesian optimization, Kriging, and related fields, including scenarios with noisy observations. A topological foundation ensures the evaluation map $e(f,x)=f(x)$ is measurable on $C(X,Y)$ via the compact-open topology, clarifying when these constructions are well-defined and highlighting limitations when $X$ lacks local compactness.

Abstract

The next evaluation point $x_{n+1}$ of a random function $\mathbf f = (\mathbf f(x))_{x\in \mathbb X}$ (a.k.a. stochastic process or random field) is often chosen based on the filtration of previously seen evaluations $\mathcal F_n := σ(\mathbf f(x_0),\dots, \mathbf f(x_n))$. This turns $x_{n+1}$ into a random variable $X_{n+1}$ and thereby $\mathbf f(X_{n+1})$ into a complex measure theoretical object. In applications, like geostatistics or Bayesian optimization, the evaluation locations $X_n$ are often treated as deterministic during the calculation of the conditional distribution $\mathbb P(\mathbf f(X_{n+1}) \in A \mid \mathcal F_n)$. We provide a framework to prove that the results obtained by this treatment are typically correct. We also treat the more general case where $X_{n+1}$ is not 'previsible' but independent from $\mathbf f$ conditional on $\mathcal F_n$ and the case of noisy evaluations.

Measure Theory of Conditionally Independent Random Function Evaluation

TL;DR

The paper develops a rigorous measure-theoretic framework for evaluating random functions at random inputs, addressing when previsible or conditionally independent inputs can be treated as deterministic in conditional calculations. It introduces joint probability kernels for collections of conditional distributions, proves existence and continuity of these kernels under suitable regularity, and provides Gaussian-process special cases with explicit, continuous forms. By relating to a measurable function and establishing continuity in the index, the work justifies common heuristics used in Bayesian optimization, Kriging, and related fields, including scenarios with noisy observations. A topological foundation ensures the evaluation map is measurable on via the compact-open topology, clarifying when these constructions are well-defined and highlighting limitations when lacks local compactness.

Abstract

The next evaluation point of a random function (a.k.a. stochastic process or random field) is often chosen based on the filtration of previously seen evaluations . This turns into a random variable and thereby into a complex measure theoretical object. In applications, like geostatistics or Bayesian optimization, the evaluation locations are often treated as deterministic during the calculation of the conditional distribution . We provide a framework to prove that the results obtained by this treatment are typically correct. We also treat the more general case where is not 'previsible' but independent from conditional on and the case of noisy evaluations.

Paper Structure

This paper contains 5 sections, 10 theorems, 52 equations.

Key Result

Theorem 2.3

Let $\mathcal{F}$ be a sub $\sigma$-algebra of the underlying probability space $(\Omega, \mathcal{A}, \mathbb{P})$ and $\mathbf{f}$ a random variable in the space of continuous function $C(\mathbb{X},\mathbb{Y})$, with locally compact, separable metrizable domain $\mathbb{X}$ and Polish co-domain $ for all $\mathcal{F}$-measurable $X$. Furthermore $x\mapsto \kappa(\omega, x; \cdot)$ is continuous

Theorems & Definitions (29)

  • Definition 2.1: Joint probability kernels
  • Example 2.2: A joint conditional distribution that is not consistent
  • Theorem 2.3: Consistency for dependent evaluations $\mathbf{f}(x)$
  • Example 2.4: Conditional minimization
  • Definition 2.5: Previsible
  • Theorem 2.6: Previsible sampling
  • Example 2.7: Gaussian case
  • Corollary 2.8: Gaussian case
  • Theorem 3.1: Consistency for dependent $\mathbf{f}(x)$
  • Remark 3.2: Existence of consistent joint conditional distribution
  • ...and 19 more