Cellular Learning: Scattered Data Regression in High Dimensions via Voronoi Cells
Shankar Prasad Sastry
TL;DR
The paper tackles regression of scattered data in high dimensions by introducing cellular learning, a method that blends per-cell linear functions centered at seed vertices to produce a continuous, piecewise-smooth regression without explicit Voronoi diagram computation. It leverages Lloyd's algorithm for seed initialization and Adam optimization to fit parameters, with regularization on coefficients and blending to improve generalization. Empirical results on MNIST with one-vs-rest classification achieve up to 98.20% accuracy using 46 cells and 722k parameters, demonstrating scalability and competitive performance without augmentation. The work offers a scalable, interpretable alternative to nonlinear models and outlines avenues for hierarchical and hyperspherical extensions to further improve efficiency and expressiveness.
Abstract
I present a regression algorithm that provides a continuous, piecewise-smooth function approximating scattered data. It is based on composing and blending linear functions over Voronoi cells, and it scales to high dimensions. The algorithm infers Voronoi cells from seed vertices and constructs a linear function for the input data in and around each cell. As the algorithm does not explicitly compute the Voronoi diagram, it avoids the curse of dimensionality. An accuracy of around 98.2% on the MNIST dataset with 722,200 degrees of freedom (without data augmentation, convolution, or other geometric operators) demonstrates the applicability and scalability of the algorithm.
