Learning Noisy Halfspaces with a Margin: Massart is No Harder than Random

Gautam Chandrasekaran; Vasilis Kontonis; Konstantinos Stavropoulos; Kevin Tian

Learning Noisy Halfspaces with a Margin: Massart is No Harder than Random

Gautam Chandrasekaran, Vasilis Kontonis, Konstantinos Stavropoulos, Kevin Tian

TL;DR

This work addresses PAC learning of γ-margin halfspaces under Massart noise η, introducing Perspectron, a simple proper learner that matches the best sample complexity under random classification and extends to Massart GLMs with a known link σ. The core technique combines a certificate-based semi-random noise framework with an inverse-margin reweighting to produce a bounded separating hyperplane, enabling SGD-like updates and cutting-plane refinements. The authors show that ε-accurate learning (_error ≤ η+ε) is achievable with ~O((εγ)^{-2}) samples, and they extend the approach to σ-Massart GLMs with comparable guarantees, improving upon prior results in both models. They discuss limitations, open questions, and note concurrent independent work delivering essentially the same results, highlighting the robustness and potential practical impact of these semi-random-noise learning strategies.

Abstract

We study the problem of PAC learning $γ$-margin halfspaces with Massart noise. We propose a simple proper learning algorithm, the Perspectron, that has sample complexity $\widetilde{O}((εγ)^{-2})$ and achieves classification error at most $η+ε$ where $η$ is the Massart noise rate. Prior works [DGT19,CKMY20] came with worse sample complexity guarantees (in both $ε$ and $γ$) or could only handle random classification noise [DDK+23,KIT+23] -- a much milder noise assumption. We also show that our results extend to the more challenging setting of learning generalized linear models with a known link function under Massart noise, achieving a similar sample complexity to the halfspace case. This significantly improves upon the prior state-of-the-art in this setting due to [CKMY20], who introduced this model.

Learning Noisy Halfspaces with a Margin: Massart is No Harder than Random

TL;DR

Abstract

We study the problem of PAC learning

-margin halfspaces with Massart noise. We propose a simple proper learning algorithm, the Perspectron, that has sample complexity

and achieves classification error at most

where

is the Massart noise rate. Prior works [DGT19,CKMY20] came with worse sample complexity guarantees (in both

and

) or could only handle random classification noise [DDK+23,KIT+23] -- a much milder noise assumption. We also show that our results extend to the more challenging setting of learning generalized linear models with a known link function under Massart noise, achieving a similar sample complexity to the halfspace case. This significantly improves upon the prior state-of-the-art in this setting due to [CKMY20], who introduced this model.

Paper Structure (21 sections, 12 theorems, 50 equations, 1 table)

This paper contains 21 sections, 12 theorems, 50 equations, 1 table.

Introduction
Our results
Massart halfspace model.
Massart generalized linear models.
Technical overview
Learning Massart halfspaces.
Learning Massart GLMs.
Related work
Concurrent and independent work.
Limitations and open problems
Preliminaries
Massart halfspaces
Separating hyperplanes for Massart halfspaces
Warmup: an "unbounded" separating hyperplane.
A "bounded" separating hyperplane for $\gamma$-margin Massart halfspaces.
...and 6 more sections

Key Result

Theorem 1

Let $D$ be an instance of the $\eta$-Massart halfspace model with margin $\gamma$, and let $\epsilon \in (0, 1)$. Then, $\mathsf{Perspectron}$ (alg:massart_margin) returns $\mathbf{w} \in \mathbb{B}^d$ such that $\ell_{\textup{0-1}}(\mathbf{w}) \le \eta + \epsilon$ with probability $0.99$,The formal

Theorems & Definitions (28)

Definition 1: Massart halfspace model
Theorem 1: Informal, see \ref{['theorem:halfspace-massart']}
Definition 2: Massart GLM, simplified
Remark 1
Theorem 2: Informal, see \ref{['theorem:glm-massart']}
Remark 2
Lemma 1: Separating hyperplane for Massart halfspaces
proof
Lemma 2: Bounded separating hyperplane for Massart halfspaces
proof
...and 18 more

Learning Noisy Halfspaces with a Margin: Massart is No Harder than Random

TL;DR

Abstract

Learning Noisy Halfspaces with a Margin: Massart is No Harder than Random

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (28)