Solomonoff induction

Tom F. Sterkenburg

Solomonoff induction

Tom F. Sterkenburg

Abstract

This chapter discusses the Solomonoff approach to universal prediction. The crucial ingredient in the approach is the notion of computability, and I present the main idea as an attempt to meet two plausible computability desiderata for a universal predictor. This attempt is unsuccessful, which is shown by a generalization of a diagonalization argument due to Putnam. I then critically discuss purported gains of the approach, in particular it providing a foundation for the methodological principle of Occam's razor, and it serving as a theoretical ideal for the development of machine learning methods.

Solomonoff induction

Abstract

Paper Structure (33 sections, 4 theorems, 21 equations)

This paper contains 33 sections, 4 theorems, 21 equations.

Introduction
Universal prediction
Sequential prediction
Universal reliability
Reliability for computability
A universal mixture
Universal optimality
Optimality among all methods
Aggregating predictors
Optimality
The universal aggregator
The diagonal argument
The Solomonoff-Levin approach
Semi-computable semi-measures
Computable functions
...and 18 more sections

Key Result

Theorem 2

For mixture predictor $\textsf{p}^\mathcal{H}_w$, for all $\mu \in \mathcal{H}$, with $\mu$-probability 1, $\textsf{p}^\mathcal{H}_w(\pmb{x}^t,x_{t+1}) \xrightarrow{t \rightarrow \infty} \mu(x_{t+1}\mid \pmb{x}^t).$

Theorems & Definitions (10)

Definition 1: Bayesian mixture
Theorem 2: Bayesian consistency
proof
Theorem 3: Optimality
proof
Theorem 4: Putnam's diagonalization
proof
proof
Theorem 6
proof

Solomonoff induction

Abstract

Solomonoff induction

Authors

Abstract

Table of Contents

Key Result

Theorems & Definitions (10)