Improved Hardness Results for Learning Intersections of Halfspaces

Stefan Tiegel

Improved Hardness Results for Learning Intersections of Halfspaces

Stefan Tiegel

TL;DR

This work establishes strong hardness for weakly learning intersections of a small number of halfspaces in high dimensions. It derives statistically and computationally significant lower bounds under standard lattice hardness (GapSVP/SIVP) that rule out polynomial-time learning of ω(log log N) halfspaces, and provides unconditional SQ-hardness results for even constant numbers of halfspaces, requiring either extremely fine accuracy or exponentially many SQ queries. A core technical contribution is a novel reduction framework that links intersections of halfspaces to the parallel pancakes distribution, enabling both SQ lower bounds and a reduction-based hardness pathway via higher-degree polynomial threshold functions. The results substantially narrow the gap between known hardness for many halfspaces and the elusive two-halfspaces case, and they introduce techniques that may inform future unconditional and standard-assumption hardness results in high-dimensional learning. The work suggests that higher-degree PTF constructions can yield stronger lower bounds and motivates seeking hard instances beyond low-degree representations to further limit the feasibility of efficient learning.

Abstract

We show strong (and surprisingly simple) lower bounds for weakly learning intersections of halfspaces in the improper setting. Strikingly little is known about this problem. For instance, it is not even known if there is a polynomial-time algorithm for learning the intersection of only two halfspaces. On the other hand, lower bounds based on well-established assumptions (such as approximating worst-case lattice problems or variants of Feige's 3SAT hypothesis) are only known (or are implied by existing results) for the intersection of super-logarithmically many halfspaces [KS09,KS06,DSS16]. With intersections of fewer halfspaces being only ruled out under less standard assumptions [DV21] (such as the existence of local pseudo-random generators with large stretch). We significantly narrow this gap by showing that even learning $ω(\log \log N)$ halfspaces in dimension $N$ takes super-polynomial time under standard assumptions on worst-case lattice problems (namely that SVP and SIVP are hard to approximate within polynomial factors). Further, we give unconditional hardness results in the statistical query framework. Specifically, we show that for any $k$ (even constant), learning $k$ halfspaces in dimension $N$ requires accuracy $N^{-Ω(k)}$, or exponentially many queries -- in particular ruling out SQ algorithms with polynomial accuracy for $ω(1)$ halfspaces. To the best of our knowledge this is the first unconditional hardness result for learning a super-constant number of halfspaces. Our lower bounds are obtained in a unified way via a novel connection we make between intersections of halfspaces and the so-called parallel pancakes distribution [DKS17,BLPR19,BRST21] that has been at the heart of many lower bound constructions in (robust) high-dimensional statistics in the past few years.

Improved Hardness Results for Learning Intersections of Halfspaces

TL;DR

Abstract

halfspaces in dimension

takes super-polynomial time under standard assumptions on worst-case lattice problems (namely that SVP and SIVP are hard to approximate within polynomial factors). Further, we give unconditional hardness results in the statistical query framework. Specifically, we show that for any

(even constant), learning

halfspaces in dimension

requires accuracy

, or exponentially many queries -- in particular ruling out SQ algorithms with polynomial accuracy for

halfspaces. To the best of our knowledge this is the first unconditional hardness result for learning a super-constant number of halfspaces. Our lower bounds are obtained in a unified way via a novel connection we make between intersections of halfspaces and the so-called parallel pancakes distribution [DKS17,BLPR19,BRST21] that has been at the heart of many lower bound constructions in (robust) high-dimensional statistics in the past few years.

Paper Structure (24 sections, 7 theorems, 41 equations, 3 figures)

This paper contains 24 sections, 7 theorems, 41 equations, 3 figures.

Introduction
Previous hardness results
Hardness assumption, SQ model and main results
Future work
More on Previous Results
Technical Overview
Relation to parallel pancakes and SQ lower bound
Modifying the instance to obtain an intersection of degree-2 PTFs
Lower bound based on \ref{['assump:lattices']}
Comparison to klivans2009cryptographic
Preliminaries
Notation
Gaussian distributions
Various versions of Continuous LWE
Hermite polynomials and moment-matching distributions
...and 9 more sections

Key Result

Theorem 1.5

Let $N, k \in \varmathbb N$ such that $k \leqslant O(\sqrt{N})$. Under assump:lattices, there is no $T = N^{o(\tfrac{k}{\log k + \log\log N})}$-time algorithm using $O(T)$ samples that learns the intersection of $k$ halfspaces up to error better than $\tfrac{1}{2} - \tfrac{1}{\Omega(T)}$.

Figures (3)

Figure : (a)
Figure : (a)
Figure : (b)

Theorems & Definitions (16)

Definition 1.1
Theorem 1.5: See \ref{['thm:main_crypto_hardness']} for full version
Theorem 1.6
Definition 3.1: CLWE Distribution
Definition 3.2: Homogeneous CLWE (hCLWE) Distribution
Definition 3.3: Truncated hCLWE Distribution
Theorem 4.1
proof : Proof of \ref{['thm:main_crypto_hardness']}
proof : Proof of \ref{['fact:clwe_to_truncated_clwe']}
Theorem 5.1
...and 6 more

Improved Hardness Results for Learning Intersections of Halfspaces

TL;DR

Abstract

Improved Hardness Results for Learning Intersections of Halfspaces

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (16)