Robust Learning with Optimal Error

Guy Blanc

Robust Learning with Optimal Error

Guy Blanc

Abstract

We construct algorithms with optimal error for learning with adversarial noise. The overarching theme of this work is that the use of \textsl{randomized} hypotheses can substantially improve upon the best error rates achievable with deterministic hypotheses. - For $η$-rate malicious noise, we show the optimal error is $\frac{1}{2} \cdot η/(1-η)$, improving on the optimal error of deterministic hypotheses by a factor of $1/2$. This answers an open question of Cesa-Bianchi et al. (JACM 1999) who showed randomness can improve error by a factor of $6/7$. - For $η$-rate nasty noise, we show the optimal error is $\frac{3}{2} \cdot η$ for distribution-independent learners and $η$ for fixed-distribution learners, both improving upon the optimal $2 η$ error of deterministic hypotheses. This closes a gap first noted by Bshouty et al. (Theoretical Computer Science 2002) when they introduced nasty noise and reiterated in the recent works of Klivans et al. (NeurIPS 2025) and Blanc et al. (SODA 2026). - For $η$-rate agnostic noise and the closely related nasty classification noise model, we show the optimal error is $η$, improving upon the optimal $2η$ error of deterministic hypotheses. All of our learners have sample complexity linear in the VC-dimension of the concept class and polynomial in the inverse excess error. All except for the fixed-distribution nasty noise learner are time efficient given access to an oracle for empirical risk minimization.

Robust Learning with Optimal Error

Abstract

-rate malicious noise, we show the optimal error is

, improving on the optimal error of deterministic hypotheses by a factor of

. This answers an open question of Cesa-Bianchi et al. (JACM 1999) who showed randomness can improve error by a factor of

. - For

-rate nasty noise, we show the optimal error is

for distribution-independent learners and

for fixed-distribution learners, both improving upon the optimal

error of deterministic hypotheses. This closes a gap first noted by Bshouty et al. (Theoretical Computer Science 2002) when they introduced nasty noise and reiterated in the recent works of Klivans et al. (NeurIPS 2025) and Blanc et al. (SODA 2026). - For

-rate agnostic noise and the closely related nasty classification noise model, we show the optimal error is

, improving upon the optimal

error of deterministic hypotheses. All of our learners have sample complexity linear in the VC-dimension of the concept class and polynomial in the inverse excess error. All except for the fixed-distribution nasty noise learner are time efficient given access to an oracle for empirical risk minimization.

Robust Learning with Optimal Error

Abstract

Robust Learning with Optimal Error

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (86)