Explaining Robustness to Catastrophic Forgetting Through Incremental Concept Formation
Nicki Barari, Edward Kim, Christopher MacLellan
TL;DR
This paper investigates why Cobweb/4V, a hierarchical, concept-formation approach, resists catastrophic forgetting in continual visual learning. It tests three hypotheses—adaptive structure, sparse updates, and information-theoretic learning with sufficiency statistics—through controlled experiments on MNIST, Fashion-MNIST, MedMNIST, and CIFAR-10, comparing against neural baselines and CobwebNN. Results indicate that while adaptive restructuring and update sparsity can influence stability, the strongest evidence for reducing forgetting comes from Cobweb/4V’s closed-form, information-theoretic updates that do not require replay or revisiting past data. The findings suggest that concept-based, probabilistic representations with sufficiency-statistics updates offer a robust alternative to gradient-based continual-learning methods and point toward integrating these mechanisms into neural systems for scalable, stable learning.
Abstract
Catastrophic forgetting remains a central challenge in continual learning, where models are required to integrate new knowledge over time without losing what they have previously learned. In prior work, we introduced Cobweb/4V, a hierarchical concept formation model that exhibited robustness to catastrophic forgetting in visual domains. Motivated by this robustness, we examine three hypotheses regarding the factors that contribute to such stability: (1) adaptive structural reorganization enhances knowledge retention, (2) sparse and selective updates reduce interference, and (3) information-theoretic learning based on sufficiency statistics provides advantages over gradient-based backpropagation. To test these hypotheses, we compare Cobweb/4V with neural baselines, including CobwebNN, a neural implementation of the Cobweb framework introduced in this work. Experiments on datasets of varying complexity (MNIST, Fashion-MNIST, MedMNIST, and CIFAR-10) show that adaptive restructuring enhances learning plasticity, sparse updates help mitigate interference, and the information-theoretic learning process preserves prior knowledge without revisiting past data. Together, these findings provide insight into mechanisms that can mitigate catastrophic forgetting and highlight the potential of concept-based, information-theoretic approaches for building stable and adaptive continual learning systems.
