Efficient PAC Learning of Halfspaces with Constant Malicious Noise Rate

Jie Shen

Efficient PAC Learning of Halfspaces with Constant Malicious Noise Rate

Jie Shen

TL;DR

This work tackles PAC learning of homogeneous halfspaces in the malicious-noise model, showing that a constant noise tolerance is achievable in polynomial time under concurrent large-margin and log-concave mixture assumptions. The method combines soft outlier removal via a linear-programming weight assignment with a reweighted hinge-loss minimization under a margin constraint, leveraging dense pancake properties to control gradient contributions. The results improve upon prior efficient algorithms by reaching $\eta = \Omega(1)$ under stated conditions, with rigorous deterministic and probabilistic analyses and explicit sample complexity. The approach offers a robust framework for adversarially corrupted data and suggests avenues for extensions to sparse models and potential linear-time algorithms, with broad implications for practical robust learning.

Abstract

Understanding noise tolerance of machine learning algorithms is a central quest in learning theory. In this work, we study the problem of computationally efficient PAC learning of halfspaces in the presence of malicious noise, where an adversary can corrupt both instances and labels of training samples. The best-known noise tolerance either depends on a target error rate under distributional assumptions or on a margin parameter under large-margin conditions. In this work, we show that when both types of conditions are satisfied, it is possible to achieve constant noise tolerance by minimizing a reweighted hinge loss. Our key ingredients include: 1) an efficient algorithm that finds weights to control the gradient deterioration from corrupted samples, and 2) a new analysis on the robustness of the hinge loss equipped with such weights.

Efficient PAC Learning of Halfspaces with Constant Malicious Noise Rate

TL;DR

under stated conditions, with rigorous deterministic and probabilistic analyses and explicit sample complexity. The approach offers a robust framework for adversarially corrupted data and suggests avenues for extensions to sparse models and potential linear-time algorithms, with broad implications for practical robust learning.

Abstract

Paper Structure (34 sections, 26 theorems, 95 equations, 1 table, 2 algorithms)

This paper contains 34 sections, 26 theorems, 95 equations, 1 table, 2 algorithms.

Introduction
Main results
Overview of our techniques
Related works
Roadmap
Preliminaries
Main Algorithm
The approach of talwar2020hinge
Our algorithm
Other potential approaches
Performance Guarantees
Deterministic results
Statistical results
Proof of Theorem \ref{['thm:main']}
Conclusion and Open Questions
...and 19 more sections

Key Result

Theorem 2

There exists an algorithm (Algorithm alg:main) satisfying the following. For any $\epsilon \in (0, \frac{2}{3}), \delta \in (0, 1)$, if Assumptions as:margin and as:log-concave are satisfied with $\gamma \geq \frac{16 \log(2/\epsilon)}{\sqrt{d}}$, $r \leq 2 \gamma$, $k \leq 64$, and if the malicious

Theorems & Definitions (51)

Definition 1: Learning with malicious noise
Theorem 2: Main result
Remark 3: Noise tolerance
Remark 4: Condition on $\gamma$
Theorem 5
Definition 6: Linear sum norm
Definition 7: Pancake
Lemma 8
Theorem 9: Main deterministic result
Lemma 10
...and 41 more

Efficient PAC Learning of Halfspaces with Constant Malicious Noise Rate

TL;DR

Abstract

Efficient PAC Learning of Halfspaces with Constant Malicious Noise Rate

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (51)