Reliable Learning of Halfspaces under Gaussian Marginals

Ilias Diakonikolas; Lisheng Ren; Nikos Zarifis

Reliable Learning of Halfspaces under Gaussian Marginals

Ilias Diakonikolas, Lisheng Ren, Nikos Zarifis

TL;DR

A new algorithm for reliable learning of Gaussian halfspaces on $\mathbb{R}^d$ with sample and computational complexity and a Statistical Query lower bound suggesting that the $d^Omega(\log (1/\alpha)$ dependence is best possible.

Abstract

We study the problem of PAC learning halfspaces in the reliable agnostic model of Kalai et al. (2012). The reliable PAC model captures learning scenarios where one type of error is costlier than the others. Our main positive result is a new algorithm for reliable learning of Gaussian halfspaces on $\mathbb{R}^d$ with sample and computational complexity $$d^{O(\log (\min\{1/α, 1/ε\}))}\min (2^{\log(1/ε)^{O(\log (1/α))}},2^{\mathrm{poly}(1/ε)})\;,$$ where $ε$ is the excess error and $α$ is the bias of the optimal halfspace. We complement our upper bound with a Statistical Query lower bound suggesting that the $d^{Ω(\log (1/α))}$ dependence is best possible. Conceptually, our results imply a strong computational separation between reliable agnostic learning and standard agnostic learning of halfspaces in the Gaussian setting.

Reliable Learning of Halfspaces under Gaussian Marginals

TL;DR

A new algorithm for reliable learning of Gaussian halfspaces on

with sample and computational complexity and a Statistical Query lower bound suggesting that the

dependence is best possible.

Abstract

with sample and computational complexity

where

is the excess error and

is the bias of the optimal halfspace. We complement our upper bound with a Statistical Query lower bound suggesting that the

dependence is best possible. Conceptually, our results imply a strong computational separation between reliable agnostic learning and standard agnostic learning of halfspaces in the Gaussian setting.

Reliable Learning of Halfspaces under Gaussian Marginals

TL;DR

Abstract

Reliable Learning of Halfspaces under Gaussian Marginals

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (48)