Mono-Forward: Backpropagation-Free Algorithm for Efficient Neural Network Training Harnessing Local Errors
James Gong, Bruce Li, Waleed Abdulla
TL;DR
The paper addresses the inefficiency and biological implausibility of backpropagation by introducing Mono-Forward (MF), a purely local, layerwise greedy training algorithm inspired by Forward-Forward. MF uses per-layer projection matrices to compute layerwise goodness $G_i=a_i M_i^\top$ and trains via cross-entropy on these scores, enabling a single forward Pass for both training and prediction with explicit label–input connections. Experimental results on MLPs and CNNs across MNIST, Fashion-MNIST, CIFAR-10, and CIFAR-100 show MF matching or surpassing BP accuracy while achieving substantially lower, more stable memory usage and better parallelizability; MF’s depth-insensitive convergence and one-pass prediction (with BP-pred option) further enhance practicality. The findings suggest MF as a scalable, modular, and more biologically plausible alternative to BP, offering improved memory efficiency and hardware-friendly parallelism without sacrificing performance.
Abstract
Backpropagation is the standard method for achieving state-of-the-art accuracy in neural network training, but it often imposes high memory costs and lacks biological plausibility. In this paper, we introduce the Mono-Forward algorithm, a purely local layerwise learning method inspired by Hinton's Forward-Forward framework. Unlike backpropagation, Mono-Forward optimizes each layer solely with locally available information, eliminating the reliance on global error signals. We evaluated Mono-Forward on multi-layer perceptrons and convolutional neural networks across multiple benchmarks, including MNIST, Fashion-MNIST, CIFAR-10, and CIFAR-100. The test results show that Mono-Forward consistently matches or surpasses the accuracy of backpropagation across all tasks, with significantly reduced and more even memory usage, better parallelizability, and a comparable convergence rate.
