Table of Contents
Fetching ...

Testably Learning Polynomial Threshold Functions

Lucas Slot, Stefan Tiegel, Manuel Wiedmer

TL;DR

This work asks whether there is a price to pay for testably learning more complex concept classes, and shows that PTFs of arbitrary constant degree can be testably learned up to excess error $\varepsilon>0$ in time.

Abstract

Rubinfeld & Vasilyan recently introduced the framework of testable learning as an extension of the classical agnostic model. It relaxes distributional assumptions which are difficult to verify by conditions that can be checked efficiently by a tester. The tester has to accept whenever the data truly satisfies the original assumptions, and the learner has to succeed whenever the tester accepts. We focus on the setting where the tester has to accept standard Gaussian data. There, it is known that basic concept classes such as halfspaces can be learned testably with the same time complexity as in the (distribution-specific) agnostic model. In this work, we ask whether there is a price to pay for testably learning more complex concept classes. In particular, we consider polynomial threshold functions (PTFs), which naturally generalize halfspaces. We show that PTFs of arbitrary constant degree can be testably learned up to excess error $\varepsilon > 0$ in time $n^{\mathrm{poly}(1/\varepsilon)}$. This qualitatively matches the best known guarantees in the agnostic model. Our results build on a connection between testable learning and fooling. In particular, we show that distributions that approximately match at least $\mathrm{poly}(1/\varepsilon)$ moments of the standard Gaussian fool constant-degree PTFs (up to error $\varepsilon$). As a secondary result, we prove that a direct approach to show testable learning (without fooling), which was successfully used for halfspaces, cannot work for PTFs.

Testably Learning Polynomial Threshold Functions

TL;DR

This work asks whether there is a price to pay for testably learning more complex concept classes, and shows that PTFs of arbitrary constant degree can be testably learned up to excess error in time.

Abstract

Rubinfeld & Vasilyan recently introduced the framework of testable learning as an extension of the classical agnostic model. It relaxes distributional assumptions which are difficult to verify by conditions that can be checked efficiently by a tester. The tester has to accept whenever the data truly satisfies the original assumptions, and the learner has to succeed whenever the tester accepts. We focus on the setting where the tester has to accept standard Gaussian data. There, it is known that basic concept classes such as halfspaces can be learned testably with the same time complexity as in the (distribution-specific) agnostic model. In this work, we ask whether there is a price to pay for testably learning more complex concept classes. In particular, we consider polynomial threshold functions (PTFs), which naturally generalize halfspaces. We show that PTFs of arbitrary constant degree can be testably learned up to excess error in time . This qualitatively matches the best known guarantees in the agnostic model. Our results build on a connection between testable learning and fooling. In particular, we show that distributions that approximately match at least moments of the standard Gaussian fool constant-degree PTFs (up to error ). As a secondary result, we prove that a direct approach to show testable learning (without fooling), which was successfully used for halfspaces, cannot work for PTFs.
Paper Structure (26 sections, 44 theorems, 246 equations)

This paper contains 26 sections, 44 theorems, 246 equations.

Key Result

Theorem 3

Fix $d \in \mathbb{N}$. Then, for any $\varepsilon > 0$, the concept class of degree-$d$ polynomial threshold functions can be testably learned up to error $\varepsilon$ w.r.t. the standard Gaussian in time and sample complexity $n^{\mathrm{poly}(1/\varepsilon)}$.

Theorems & Definitions (78)

  • Definition 1: Agnostic learning with distributional assumptions
  • Definition 2: Testable learning RubinfeldVasilyan:testlearning
  • Theorem 3: Informal version of \ref{['THM:PTFPROOF:testablelearningofPTFs']}
  • Definition 4: Moment matching
  • Definition 5
  • Theorem 6: KKM:agnostichalfspaces
  • Theorem 7
  • Definition 8: Fooling
  • Theorem 9: GollakotaKlivansKothari:testlearningusingfooling
  • Remark 10
  • ...and 68 more