DNF Learning via Locally Mixing Random Walks
Josh Alman, Shivam Nadimpalli, Shyamal Patel, Rocco A. Servedio
TL;DR
This work advances distribution-free PAC learning of DNFs with membership queries by introducing a novel Locally Mixing Random Walks framework and two enabling results. First, it achieves quasi-polynomial time list-decoding for a single term of an unknown $s$-term DNF under an arbitrary distribution, producing a candidate list guaranteed to contain a true term with high probability. Second, it yields a quasipolynomial-time distribution-free PAC+MQ learner for the subclass of size-$s$ DNFs with all terms of the same size (exact-$k$ DNFs), leveraging a pruning-and-expansion strategy that iteratively builds a compact, correct hypothesis. The key technical engine is a local-mixing analysis on graphs covered by expanders, which enables sampling near-uniformly from promising term-sets and supports the far-point-based learning augmentation. Together, these results push toward efficient learning of rich DNF classes in the distribution-free setting and showcase the power of random-walk and expander techniques in algorithmic learning.
Abstract
We give two results on PAC learning DNF formulas using membership queries in the challenging "distribution-free" learning framework, where learning algorithms must succeed for an arbitrary and unknown distribution over $\{0,1\}^n$. (1) We first give a quasi-polynomial time "list-decoding" algorithm for learning a single term of an unknown DNF formula. More precisely, for any target $s$-term DNF formula $f = T_1 \vee \cdots \vee T_s$ over $\{0,1\}^n$ and any unknown distribution $D$ over $\{0,1\}^n$, our algorithm, which uses membership queries and random examples from $D$, runs in $\textsf{quasipoly}(n,s)$ time and outputs a list $L$ of candidate terms such that with high probability some term $T_i$ of $f$ belongs to $L$. (2) We then use result (1) to give a $\textsf{quasipoly}(n,s)$-time algorithm, in the distribution-free PAC learning model with membership queries, for learning the class of size-$s$ DNFs in which all terms have the same size. Our algorithm learns using a DNF hypothesis. The key tool used to establish result (1) is a new result on "locally mixing random walks," which, roughly speaking, shows that a random walk on a graph that is covered by a small number of expanders has a non-negligible probability of mixing quickly in a subset of these expanders.
