Implicit Regularization in Feedback Alignment Learning Mechanisms for Neural Networks
Zachary Robertson, Oluwasanmi Koyejo
TL;DR
The paper addresses the gap in understanding how Feedback Alignment (FA) can learn effectively while remaining biologically plausible. It introduces a modular framework that decomposes learning Dynamics into alignment, gradient, and FA components, and proves a conservation law that enforces alignment at the neuron level, enabling an alignment-dominance-based convergence analysis. Theoretical results show that, under certain data separability conditions and initialization schemes, FA (including sign-FA and adaFA) converges, with explicit exponential or generalized decay of the loss, and that alignment quality correlates with improved multi-class performance. Empirically, the framework is validated on MNIST, CIFAR-100, and Tiny-ImageNet, demonstrating alignment conservation, enhanced performance with better alignment, and insights into benign overfitting in FA-based learning. These findings advance interpretability of bio-plausible learning and provide practical guidance for designing FA-based algorithms with improved convergence and generalization in complex tasks.
Abstract
Feedback Alignment (FA) methods are biologically inspired local learning rules for training neural networks with reduced communication between layers. While FA has potential applications in distributed and privacy-aware ML, limitations in multi-class classification and lack of theoretical understanding of the alignment mechanism have constrained its impact. This study introduces a unified framework elucidating the operational principles behind alignment in FA. Our key contributions include: (1) a novel conservation law linking changes in synaptic weights to implicit regularization that maintains alignment with the gradient, with support from experiments, (2) sufficient conditions for convergence based on the concept of alignment dominance, and (3) empirical analysis showing better alignment can enhance FA performance on complex multi-class tasks. Overall, these theoretical and practical advancements improve interpretability of bio-plausible learning rules and provide groundwork for developing enhanced FA algorithms.
