One-Time Soft Alignment Enables Resilient Learning without Weight Transport
Jeonghwan Cheon, Jaehyuk Bae, Se-Bum Paik
TL;DR
The paper addresses the inefficiency and biological implausibility of backpropagation by proposing a one-time soft alignment between forward and backward weights at initialization. This initial alignment enables deep networks to learn without weight transport, achieving performance close to standard backpropagation and surpassing traditional feedback alignment in stability and generalization. Through spectral analyses, the authors show that IFA guides the optimization toward smoother, flatter minima, which improves robustness to input corruptions and under adversarial perturbations. The approach offers a simple, hardware-friendly alternative with potential benefits for energy-efficient learning and neuromorphic implementations, while acknowledging remaining gaps for very deep or complex models.
Abstract
Backpropagation is the cornerstone of deep learning, but its reliance on symmetric weight transport and global synchronization makes it computationally expensive and biologically implausible. Feedback alignment offers a promising alternative by approximating error gradients through fixed random feedback, thereby avoiding symmetric weight transport. However, this approach often struggles with poor learning performance and instability, especially in deep networks. Here, we show that a one-time soft alignment between forward and feedback weights at initialization enables deep networks to achieve performance comparable to backpropagation, without requiring weight transport during learning. This simple initialization condition guides stable error minimization in the loss landscape, improving network trainability. Spectral analyses further reveal that initial alignment promotes smoother gradient flow and convergence to flatter minima, resulting in better generalization and robustness. Notably, we also find that allowing moderate deviations from exact weight symmetry can improve adversarial robustness compared to standard backpropagation. These findings demonstrate that a simple initialization strategy can enable effective learning in deep networks in a biologically plausible and resource-efficient manner.
