Hardness of Learning Neural Networks under the Manifold Hypothesis
Bobak T. Kiani, Jason Wang, Melanie Weber
TL;DR
This work analyzes the impact of the manifold geometry on the learnability of neural networks. It shows that data lying on bounded-curvature manifolds can be provably hard to learn in the statistical-query framework, while introducing volume-based assumptions yields a simple interpolation-based learnability proof for efficiently sampleable manifolds, including those reconstructible by manifold learning. The authors provide cryptographic hardness results to reinforce the hardness in difficult geometric regimes and complement theory with experiments that validate learnability in the favorable regime and hardness in the challenging one, along with an empirical study of intrinsic dimensionality in real image datasets. Together, these results delineate when geometry helps or hinders learnability and suggest that real-world data likely reside in intermediate, heterogeneous geometric regimes that demand more nuanced algorithms or architectures.
Abstract
The manifold hypothesis presumes that high-dimensional data lies on or near a low-dimensional manifold. While the utility of encoding geometric structure has been demonstrated empirically, rigorous analysis of its impact on the learnability of neural networks is largely missing. Several recent results have established hardness results for learning feedforward and equivariant neural networks under i.i.d. Gaussian or uniform Boolean data distributions. In this paper, we investigate the hardness of learning under the manifold hypothesis. We ask which minimal assumptions on the curvature and regularity of the manifold, if any, render the learning problem efficiently learnable. We prove that learning is hard under input manifolds of bounded curvature by extending proofs of hardness in the SQ and cryptographic settings for Boolean data inputs to the geometric setting. On the other hand, we show that additional assumptions on the volume of the data manifold alleviate these fundamental limitations and guarantee learnability via a simple interpolation argument. Notable instances of this regime are manifolds which can be reliably reconstructed via manifold learning. Looking forward, we comment on and empirically explore intermediate regimes of manifolds, which have heterogeneous features commonly found in real world data.
