CODED-SMOOTHING: Coding Theory Helps Generalization
Parsa Moradi, Tayyebeh Jahaninezhad, Mohammad Ali Maddah-Ali
TL;DR
The paper addresses generalization and adversarial vulnerability by introducing the coded-smoothing module, a spline-based, coded-computing-inspired regularizer that encodes a batch of $K$ inputs into $N$ coded samples, processes them through a shared computation, and decodes to enforce local higher-order smoothness via a training objective that blends main and coded paths. It also proposes Randomized Coded Inference (RCI) to bolster robustness at inference by random batch permutations, incurring negligible overhead. Empirical results across supervised and unsupervised tasks, including WGAN-GP, demonstrate improved generalization and state-of-the-art adversarial robustness against gradient-based attacks with minimal computational cost. Overall, the approach provides a practical, scalable regularization framework that unifies supervised and unsupervised settings while delivering robustness benefits through randomized inference.
Abstract
We introduce the coded-smoothing module, which can be seamlessly integrated into standard training pipelines, both supervised and unsupervised, to regularize learning and improve generalization with minimal computational overhead. In addition, it can be incorporated into the inference pipeline to randomize the model and enhance robustness against adversarial perturbations. The design of coded-smoothing is inspired by general coded computing, a paradigm originally developed to mitigate straggler and adversarial failures in distributed computing by processing linear combinations of the data rather than the raw inputs. Building on this principle, we adapt coded computing to machine learning by designing an efficient and effective regularization mechanism that encourages smoother representations and more generalizable solutions. Extensive experiments on both supervised and unsupervised tasks demonstrate that coded-smoothing consistently improves generalization and achieves state-of-the-art robustness against gradient-based adversarial attacks.
