Table of Contents
Fetching ...

Testing noisy low-degree polynomials for sparsity

Yiqiao Bao, Anindya De, Shivam Nadimpalli, Rocco A. Servedio, Nathan White

TL;DR

The paper studies the problem of testing whether an unknown degree-$d$ multilinear polynomial $p:\mathbb{R}^n\to\mathbb{R}$, given noisy evaluations on samples from a finitely supported product distribution $\boldsymbol{X}^{\otimes n}$, is $s$-sparse or far from $s$-sparse. It develops a sharp, computable threshold $\mathrm{MSG}_{\boldsymbol{X},d}(s)$ that separates constant-sample testability from requiring $\Omega(\log n)$ samples, and proves that a tester with constant sample complexity exists exactly when the target sparsity $T$ satisfies $T \ge \mathrm{MSG}_{\boldsymbol{X},d}(s)$. The analysis extends key anti-concentration and hypercontractivity techniques from the Boolean and Gaussian settings to general finitely supported product distributions, via a generalized DFKO framework and cumulant-based moment estimation to cope with noisy labels. This yields a complete, dimension-free characterization of sparsity testing for low-degree polynomials, including a computable algorithm to determine the gap threshold. The results advance understanding of when sparse low-degree polynomials can be efficiently tested from noisy samples, with implications for related learning and testing problems in high-dimensional statistics and computational learning theory.

Abstract

We consider the problem of testing whether an unknown low-degree polynomial $p$ over $\mathbb{R}^n$ is sparse versus far from sparse, given access to noisy evaluations of the polynomial $p$ at \emph{randomly chosen points}. This is a property-testing analogue of classical problems on learning sparse low-degree polynomials with noise, extending the work of Chen, De, and Servedio (2020) from noisy \emph{linear} functions to general low-degree polynomials. Our main result gives a \emph{precise characterization} of when sparsity testing for low-degree polynomials admits constant sample complexity independent of dimension, together with a matching constant-sample algorithm in that regime. For any mean-zero, variance-one finitely supported distribution $\boldsymbol{X}$ over the reals, degree $d$, and any sparsity parameters $s \leq T$, we define a computable function $\mathrm{MSG}_{\boldsymbol{X},d}(\cdot)$, and: - For $T \ge \mathrm{MSG}_{\boldsymbol{X},d}(s)$, we give an $O_{s,\boldsymbol{X},d}(1)$-sample algorithm that distinguishes whether a multilinear degree-$d$ polynomial over $\mathbb{R}^n$ is $s$-sparse versus $\varepsilon$-far from $T$-sparse, given examples $(\boldsymbol{x},\, p(\boldsymbol{x}) + \mathrm{noise})_{\boldsymbol{x} \sim \boldsymbol{X}^{\otimes n}}$. Crucially, the sample complexity is \emph{completely independent} of the ambient dimension $n$. - For $T \leq \mathrm{MSG}_{\boldsymbol{X},d}(s) - 1$, we show that even without noise, any algorithm given samples $(\boldsymbol{x},p(\boldsymbol{x}))_{\boldsymbol{x} \sim \boldsymbol{X}^{\otimes n}}$ must use $Ω_{\boldsymbol{X},d,s}(\log n)$ examples. Our techniques employ a generalization of the results of Dinur et al. (2007) on the Fourier tails of bounded functions over $\{0,1\}^n$ to a broad range of finitely supported distributions, which may be of independent interest.

Testing noisy low-degree polynomials for sparsity

TL;DR

The paper studies the problem of testing whether an unknown degree- multilinear polynomial , given noisy evaluations on samples from a finitely supported product distribution , is -sparse or far from -sparse. It develops a sharp, computable threshold that separates constant-sample testability from requiring samples, and proves that a tester with constant sample complexity exists exactly when the target sparsity satisfies . The analysis extends key anti-concentration and hypercontractivity techniques from the Boolean and Gaussian settings to general finitely supported product distributions, via a generalized DFKO framework and cumulant-based moment estimation to cope with noisy labels. This yields a complete, dimension-free characterization of sparsity testing for low-degree polynomials, including a computable algorithm to determine the gap threshold. The results advance understanding of when sparse low-degree polynomials can be efficiently tested from noisy samples, with implications for related learning and testing problems in high-dimensional statistics and computational learning theory.

Abstract

We consider the problem of testing whether an unknown low-degree polynomial over is sparse versus far from sparse, given access to noisy evaluations of the polynomial at \emph{randomly chosen points}. This is a property-testing analogue of classical problems on learning sparse low-degree polynomials with noise, extending the work of Chen, De, and Servedio (2020) from noisy \emph{linear} functions to general low-degree polynomials. Our main result gives a \emph{precise characterization} of when sparsity testing for low-degree polynomials admits constant sample complexity independent of dimension, together with a matching constant-sample algorithm in that regime. For any mean-zero, variance-one finitely supported distribution over the reals, degree , and any sparsity parameters , we define a computable function , and: - For , we give an -sample algorithm that distinguishes whether a multilinear degree- polynomial over is -sparse versus -far from -sparse, given examples . Crucially, the sample complexity is \emph{completely independent} of the ambient dimension . - For , we show that even without noise, any algorithm given samples must use examples. Our techniques employ a generalization of the results of Dinur et al. (2007) on the Fourier tails of bounded functions over to a broad range of finitely supported distributions, which may be of independent interest.

Paper Structure

This paper contains 61 sections, 46 theorems, 192 equations, 1 figure, 5 algorithms.

Key Result

Theorem 2

Fix any finitely supported random variable $\boldsymbol{X}$ over $\mathbb{R}$ which has $\operatorname{{\bf E}}[\boldsymbol{X}]=0$ and $\operatorname{{\bf Var}}[\boldsymbol{X}]=1$. Let $\mathcal{D}$ be the i.i.d. product distribution $\mathcal{D} \equiv \boldsymbol{X}^{\otimes n}$. Let $\boldsymbol{ Then there is an algorithm for the $(\boldsymbol{X},d,\boldsymbol{\eta},s,T,\varepsilon)$ polynomia

Figures (1)

  • Figure 1: Schematic diagram of the sets of polynomials we are concerned with.

Theorems & Definitions (99)

  • Definition 1
  • Definition 2: Sparsity testing problem with gap parameter $T$
  • Theorem 2: The first algorithm: A coarse but explicit upper bound
  • Definition 3
  • Theorem 3: $\MSG_{\bX,d}(s)$ is well-defined
  • Theorem 3: Algorithm for computing $\MSG_{\bX,d}(s)$
  • Theorem 3: Lower bound on the polynomial sparsity testing problem
  • Theorem 3: Upper bound on the polynomial sparsity testing problem
  • Lemma 4
  • Theorem 5: Theorem 10.21 of odonnell-book
  • ...and 89 more