Simple online learning with consistent oracle

Alexander Kozachinskiy; Tomasz Steifer

Simple online learning with consistent oracle

Alexander Kozachinskiy, Tomasz Steifer

TL;DR

The paper investigates online learning when the learner has access to a consistent oracle for a hypothesis class with Littlestone dimension $d$, addressing a computational bottleneck of traditional SOA-based methods. It introduces a simpler algorithm that maintains an evolving set of consistent functions and uses majority voting with pruning to achieve an upper bound of $O(256^d)$ mistakes, along with a modular recursive construction to guarantee progress in Littlestone dimension. A complementary lower bound shows that no algorithm in this model can beat $3^d$ mistakes, establishing a tight exponential gap and clarifying the landscape of possible exponents. The work also discusses implications for finite truth-table hypothesis classes and for recursively enumerable (RER) classes, highlighting the practicality of consistent-oracle-based online learning when ERM/SOA are infeasible.

Abstract

We consider online learning in the model where a learning algorithm can access the class only via the \emph{consistent oracle} -- an oracle, that, at any moment, can give a function from the class that agrees with all examples seen so far. This model was recently considered by Assos et al.~(COLT'23). It is motivated by the fact that standard methods of online learning rely on computing the Littlestone dimension of subclasses, a computationally intractable problem. Assos et al.~gave an online learning algorithm in this model that makes at most $C^d$ mistakes on classes of Littlestone dimension $d$, for some absolute unspecified constant $C > 0$. We give a novel algorithm that makes at most $O(256^d)$ mistakes. Our proof is significantly simpler and uses only very basic properties of the Littlestone dimension. We also show that there exists no algorithm in this model that makes less than $3^d$ mistakes.

Simple online learning with consistent oracle

TL;DR

The paper investigates online learning when the learner has access to a consistent oracle for a hypothesis class with Littlestone dimension

, addressing a computational bottleneck of traditional SOA-based methods. It introduces a simpler algorithm that maintains an evolving set of consistent functions and uses majority voting with pruning to achieve an upper bound of

mistakes, along with a modular recursive construction to guarantee progress in Littlestone dimension. A complementary lower bound shows that no algorithm in this model can beat

mistakes, establishing a tight exponential gap and clarifying the landscape of possible exponents. The work also discusses implications for finite truth-table hypothesis classes and for recursively enumerable (RER) classes, highlighting the practicality of consistent-oracle-based online learning when ERM/SOA are infeasible.

Abstract

mistakes on classes of Littlestone dimension

, for some absolute unspecified constant

. We give a novel algorithm that makes at most

mistakes. Our proof is significantly simpler and uses only very basic properties of the Littlestone dimension. We also show that there exists no algorithm in this model that makes less than

mistakes.

Paper Structure (5 sections, 8 theorems, 15 equations, 3 algorithms)

This paper contains 5 sections, 8 theorems, 15 equations, 3 algorithms.

Introduction
Preliminaries
The model and a simple lower bound
The algorithm
The $3^d$ lower bound

Key Result

Proposition 1

For any hypothesis class $H$ we have $\mathsf{Ldim}(H) \le \log_2(|H|)$.

Theorems & Definitions (17)

Proposition 1
Proposition 2
Remark 1
Theorem 1: Littlestone littlestone1988learning
Remark 2
Proposition 3
proof
Theorem 2
proof : Proof of \ref{['thm_main']}
Definition 1
...and 7 more

Simple online learning with consistent oracle

TL;DR

Abstract

Simple online learning with consistent oracle

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (17)