Improved Replicable Boosting with Majority-of-Majorities

Kasper Green Larsen; Markus Engelund Mathiasen; Clement Svendsen

Improved Replicable Boosting with Majority-of-Majorities

Kasper Green Larsen, Markus Engelund Mathiasen, Clement Svendsen

TL;DR

This work tackles the problem of achieving $\rho$-replicability in boosting with favorable data efficiency in the weak-to-strong learning setting. It introduces a two-layer approach: an improved inner replicable learner $rBoost^*$ and a replicable threshold subroutine $rThreshold$, orchestrated by a meta-boosting algorithm $rMetaBoost$ that uses rejection sampling to maintain dense reweightings. The main results establish $\rho$-replicability, $O(\ln(1/\varepsilon)/\gamma^2)$ inner calls to the weak learner, and a total sample complexity of $\tilde{O}\left( \frac{m_{\mathcal{W}(\Theta(\rho\gamma^2))}}{\varepsilon\gamma^2} + \frac{1}{\rho^2\varepsilon\gamma^3} \right)$, improving on prior bounds by reducing dependence on $\varepsilon$ and $\gamma$. A practical replicable threshold check is also introduced, expanding the toolkit for replicable algorithms and enabling more data-efficient replicable boosting in practice.

Abstract

We introduce a new replicable boosting algorithm which significantly improves the sample complexity compared to previous algorithms. The algorithm works by doing two layers of majority voting, using an improved version of the replicable boosting algorithm introduced by Impagliazzo et al. [2022] in the bottom layer.

Improved Replicable Boosting with Majority-of-Majorities

TL;DR

This work tackles the problem of achieving

-replicability in boosting with favorable data efficiency in the weak-to-strong learning setting. It introduces a two-layer approach: an improved inner replicable learner

and a replicable threshold subroutine

, orchestrated by a meta-boosting algorithm

that uses rejection sampling to maintain dense reweightings. The main results establish

-replicability,

inner calls to the weak learner, and a total sample complexity of

, improving on prior bounds by reducing dependence on

and

. A practical replicable threshold check is also introduced, expanding the toolkit for replicable algorithms and enabling more data-efficient replicable boosting in practice.

Improved Replicable Boosting with Majority-of-Majorities

TL;DR

Abstract

Improved Replicable Boosting with Majority-of-Majorities

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (20)