Table of Contents
Fetching ...

Solomonoff induction

Tom F. Sterkenburg

Abstract

This chapter discusses the Solomonoff approach to universal prediction. The crucial ingredient in the approach is the notion of computability, and I present the main idea as an attempt to meet two plausible computability desiderata for a universal predictor. This attempt is unsuccessful, which is shown by a generalization of a diagonalization argument due to Putnam. I then critically discuss purported gains of the approach, in particular it providing a foundation for the methodological principle of Occam's razor, and it serving as a theoretical ideal for the development of machine learning methods.

Solomonoff induction

Abstract

This chapter discusses the Solomonoff approach to universal prediction. The crucial ingredient in the approach is the notion of computability, and I present the main idea as an attempt to meet two plausible computability desiderata for a universal predictor. This attempt is unsuccessful, which is shown by a generalization of a diagonalization argument due to Putnam. I then critically discuss purported gains of the approach, in particular it providing a foundation for the methodological principle of Occam's razor, and it serving as a theoretical ideal for the development of machine learning methods.
Paper Structure (33 sections, 4 theorems, 21 equations)

This paper contains 33 sections, 4 theorems, 21 equations.

Key Result

Theorem 2

For mixture predictor $\textsf{p}^\mathcal{H}_w$, for all $\mu \in \mathcal{H}$, with $\mu$-probability 1, $\textsf{p}^\mathcal{H}_w(\pmb{x}^t,x_{t+1}) \xrightarrow{t \rightarrow \infty} \mu(x_{t+1}\mid \pmb{x}^t).$

Theorems & Definitions (10)

  • Definition 1: Bayesian mixture
  • Theorem 2: Bayesian consistency
  • proof
  • Theorem 3: Optimality
  • proof
  • Theorem 4: Putnam's diagonalization
  • proof
  • proof
  • Theorem 6
  • proof