Symmetric Distributions from Shallow Circuits
Daniel M. Kane, Anthony Ostuni, Kewen Wu
TL;DR
The paper addresses which symmetric distributions over $\{0,1\\}^n$ can be generated by shallow circuits, focusing on $d$-local Boolean functions. It proves that if a $d$-local function $f$ applied to uniform inputs yields a distribution close to a symmetric target, then the target is well-approximated by a mixture of the uniform distributions over even and odd Hamming weights and $\\gamma$-biased product distributions with $\\gamma$ a dyadic multiple of $2^{-d}$. The mixing weights are shown to be governed by low-degree $\\f{F}_2$-polynomials, and the mixture can be realized by $O_d(1)$-local constructions, generalizing prior uniform-symmetric classifications. The authors develop a four-step framework—removing large influences, Kolmogorov-distance analysis, approximate continuity, and composition—to connect local output structure to global weight distributions and assemble the final characterization. They also discuss learning implications for locally-sampleable symmetric distributions and outline open problems on exact characterization and broader biases, offering a roadmap for exact classification conjectures and potential extensions.
Abstract
We characterize the symmetric distributions that can be (approximately) generated by shallow Boolean circuits. More precisely, let $f\colon \{0,1\}^m \to \{0,1\}^n$ be a Boolean function where each output bit depends on at most $d$ input bits. Suppose the output distribution of $f$ evaluated on uniformly random input bits is close in total variation distance to a symmetric distribution $\mathcal{D}$ over $\{0,1\}^n$. Then $\mathcal{D}$ must be close to a mixture of the uniform distribution over $n$-bit strings of even Hamming weight, the uniform distribution over $n$-bit strings of odd Hamming weight, and $γ$-biased product distributions for $γ$ an integer multiple of $2^{-d}$. Moreover, the mixing weights are determined by low-degree, sparse $\mathbb{F}_2$-polynomials. This extends the previous classification for generating symmetric distributions that are also uniform over their support.
