Simplicial covering dimension of extremal concept classes
Ari Blondal, Hamed Hatami, Pooya Hatami, Chavdar Lalov, Sivan Tretiak
TL;DR
We introduce the simplicial covering dimension $\mathop{\mathrm{sc}}(\mathcal{C})$ for binary concept classes, defined via the topology of the space of realizable distributions and refined by zero-loss sets. In the finite-domain setting, we prove $\mathop{\mathrm{lr}}(\mathcal{C}) = \mathop{\mathrm{sc}}(\mathcal{C}) + 1$, linking a topological invariant to list replicability. We classify the exact $\mathop{\mathrm{sc}}$-values for extremal classes: if $\mathcal{E} \neq \{\pm 1\}^{\mathcal{X}}$, then $\mathop{\mathrm{sc}}(\mathcal{E}) = \mathop{\mathrm{vc}}(\mathcal{E})$ and $\mathop{\mathrm{lr}}(\mathcal{E}) = \mathop{\mathrm{vc}}(\mathcal{E}) + 1$; for the full binary cube, $\mathop{\mathrm{sc}} = |\mathcal{X}| - 1$ and $\mathop{\mathrm{lr}} = |\mathcal{X}|$. The analysis leverages a deformation retraction between the Δ-structure of realizable distributions and the cubical complex of strongly shattered sets, together with Lebesgue’s covering theory, to derive tight bounds. The results unify and extend known values for several natural classes via a topological framework, and open questions remain about VC-based bounds and dual-VC interactions with $\mathop{\mathrm{sc}}(\cdot)$.
Abstract
Dimension theory is a branch of topology concerned with defining and analyzing dimensions of geometric and topological spaces in purely topological terms. In this work, we adapt the classical notion of topological dimension (Lebesgue covering) to binary concept classes. The topological space naturally associated with a concept class is its space of realizable distributions. The loss function and the class itself induce a simplicial structure on this space, with respect to which we define a simplicial covering dimension. We prove that for finite concept classes, this simplicial covering dimension exactly characterizes the list replicability number (equivalently, global stability) in PAC learning. This connection allows us to apply tools from classical dimension theory to compute the exact list replicability number of the broad family of extremal concept classes.
