Table of Contents
Fetching ...

Revisiting Agnostic Boosting

Arthur da Cunha, Mikael Møller Høgsgaard, Andrea Paudice, Yuxin Sun

TL;DR

The paper addresses agnostic boosting for binary classification by proposing a three-stage approach that combines a reduction to realizable boosting, a margin-based pruning of hypotheses, and a validation-based extraction of a final classifier. It proves a near-optimal sample complexity bound that interpolates between agnostic and realizable settings and provides a matching lower bound up to logarithmic factors. The key ideas include relabeling data across a reference class, a margin-based filtering step to shrink an exponentially large hypothesis set to a logarithmic size, and a final selection via a third data split with rigorous generalization guarantees. Overall, the work advances theoretical understanding of agnostic boosting and points toward future work on computationally efficient implementations and tighter logarithmic-factor removal.

Abstract

Boosting is a key method in statistical learning, allowing for converting weak learners into strong ones. While well studied in the realizable case, the statistical properties of weak-to-strong learning remain less understood in the agnostic setting, where there are no assumptions on the distribution of the labels. In this work, we propose a new agnostic boosting algorithm with substantially improved sample complexity compared to prior works under very general assumptions. Our approach is based on a reduction to the realizable case, followed by a margin-based filtering of high-quality hypotheses. Furthermore, we show a nearly-matching lower bound, settling the sample complexity of agnostic boosting up to logarithmic factors.

Revisiting Agnostic Boosting

TL;DR

The paper addresses agnostic boosting for binary classification by proposing a three-stage approach that combines a reduction to realizable boosting, a margin-based pruning of hypotheses, and a validation-based extraction of a final classifier. It proves a near-optimal sample complexity bound that interpolates between agnostic and realizable settings and provides a matching lower bound up to logarithmic factors. The key ideas include relabeling data across a reference class, a margin-based filtering step to shrink an exponentially large hypothesis set to a logarithmic size, and a final selection via a third data split with rigorous generalization guarantees. Overall, the work advances theoretical understanding of agnostic boosting and points toward future work on computationally efficient implementations and tighter logarithmic-factor removal.

Abstract

Boosting is a key method in statistical learning, allowing for converting weak learners into strong ones. While well studied in the realizable case, the statistical properties of weak-to-strong learning remain less understood in the agnostic setting, where there are no assumptions on the distribution of the labels. In this work, we propose a new agnostic boosting algorithm with substantially improved sample complexity compared to prior works under very general assumptions. Our approach is based on a reduction to the realizable case, followed by a margin-based filtering of high-quality hypotheses. Furthermore, we show a nearly-matching lower bound, settling the sample complexity of agnostic boosting up to logarithmic factors.

Paper Structure

This paper contains 20 sections, 27 theorems, 204 equations.

Key Result

Theorem 1.2

There exist universal constants $C_1, C_2, C_3, C_{4} > 0$ for which the following holds. Given any $L \in (0,1)$, any $\gamma, \varepsilon_0, \delta_0 \in (0, 1]$, and any integer $d \ge C_1 \ln(1/\gamma^{2})$, for $m_0 = \Ceil[]{C_2 d \ln\Par{\frac{1}{\delta_{0}\gamma^{2}}} / \Par{\varepsilon_{0}^

Theorems & Definitions (43)

  • Definition 1.1: Agnostic Weak Learner
  • Theorem 1.2
  • Theorem 1.3
  • Lemma 3.1: Realizable Learning Gaurantee of \ref{['alg:adaboost']}
  • Lemma 3.2
  • Lemma 3.3
  • Theorem 3.4
  • Definition A.1: Agnostic Weak Learner of Ben-DavidLM01
  • Definition A.2: Agnostic Weak Learner used in this article
  • Definition A.3: Agnostic Weak Learner KalaiK09
  • ...and 33 more