Self-Contrastive Forward-Forward Algorithm
Xing Chen, Dongshu Liu, Jeremie Laydevant, Julie Grollier
TL;DR
SCFF introduces Self-Contrastive Forward-Forward, a forward-only, local-learning algorithm that generates self-derived positive and negative inputs to optimize layer-wise goodness without backpropagation. It extends FF to unsupervised and sequential tasks, achieving state-of-the-art results among purely forward/local methods on MNIST, CIFAR-10, STL-10, and Tiny ImageNet, and demonstrates effective time-series learning with Bi-RNNs on FSDD. The method combines greedy or joint training, a sigmoid-based loss on layer goodness, and principled negative-sample design that positions negatives between positive clusters, enabling robust representation learning suitable for hardware-constrained environments. These results highlight SCFF’s potential for high-accuracy, real-time edge learning, and its relevance for neuromorphic computing where forward-only, local updates are advantageous.
Abstract
Agents that operate autonomously benefit from lifelong learning capabilities. However, compatible training algorithms must comply with the decentralized nature of these systems, which imposes constraints on both the parameter counts and the computational resources. The Forward-Forward (FF) algorithm is one of these. FF relies only on feedforward operations, the same used for inference, for optimizing layer-wise objectives. This purely forward approach eliminates the need for transpose operations required in traditional backpropagation. Despite its potential, FF has failed to reach state-of-the-art performance on most standard benchmark tasks, in part due to unreliable negative data generation methods for unsupervised learning. In this work, we propose the Self-Contrastive Forward-Forward (SCFF) algorithm, a competitive training method aimed at closing this performance gap. Inspired by standard self-supervised contrastive learning for vision tasks, SCFF generates positive and negative inputs applicable across various datasets. The method demonstrates superior performance compared to existing unsupervised local learning algorithms on several benchmark datasets, including MNIST, CIFAR-10, STL-10, and Tiny ImageNet. We extend FF's application to training recurrent neural networks, expanding its utility to sequential data tasks. These findings pave the way for high-accuracy, real-time learning on resource-constrained edge devices.
