A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks
Erik B. Terres-Escudero, Javier Del Ser, Pablo Garcia Bringas
TL;DR
This paper introduces the Symmetric Forward-Forward Algorithm (SFFA), a symmetric extension of the Forward-Forward Algorithm (FFA) designed to address asymmetries in FFA's loss landscape and improve generalization for continual learning. By partitioning each layer into positive and negative neuron groups and defining a balanced probability-based goodness, SFFA achieves more stable, layer-wise updates and naturally supports sparse latent representations favorable for CL. The authors demonstrate that SFFA improves generalization over FFA, achieves competitive performance with backpropagation on several image datasets, and can be integrated with standard CL methods across Class IL, Domain IL, and Task IL, albeit with limitations for Task IL due to weight-sharing constraints. The work identifies a promising direction for forward-only learning in continual learning settings and outlines future avenues, including architectural adaptations and convolutional enhancements, to broaden applicability and performance.
Abstract
The so-called Forward-Forward Algorithm (FFA) has recently gained momentum as an alternative to the conventional back-propagation algorithm for neural network learning, yielding competitive performance across various modeling tasks. By replacing the backward pass of gradient back-propagation with two contrastive forward passes, the FFA avoids several shortcomings undergone by its predecessor (e.g., vanishing/exploding gradient) by enabling layer-wise training heuristics. In classification tasks, this contrastive method has been proven to effectively create a latent sparse representation of the input data, ultimately favoring discriminability. However, FFA exhibits an inherent asymmetric gradient behavior due to an imbalanced loss function between positive and negative data, adversely impacting on the model's generalization capabilities and leading to an accuracy degradation. To address this issue, this work proposes the Symmetric Forward-Forward Algorithm (SFFA), a novel modification of the original FFA which partitions each layer into positive and negative neurons. This allows the local fitness function to be defined as the ratio between the activation of positive neurons and the overall layer activity, resulting in a symmetric loss landscape during the training phase. To evaluate the enhanced convergence of our method, we conduct several experiments using multiple image classification benchmarks, comparing the accuracy of models trained with SFFA to those trained with its FFA counterpart. As a byproduct of this reformulation, we explore the advantages of using a layer-wise training algorithm for Continual Learning (CL) tasks. The specialization of neurons and the sparsity of their activations induced by layer-wise training algorithms enable efficient CL strategies that incorporate new knowledge (classes) into the neural network, while preventing catastrophic forgetting of previously...
