Tuned Compositional Feature Replays for Efficient Stream Learning

Morgan B. Talbot; Rushikesh Zawar; Rohil Badkundri; Mengmi Zhang; Gabriel Kreiman

Tuned Compositional Feature Replays for Efficient Stream Learning

Morgan B. Talbot, Rushikesh Zawar, Rohil Badkundri, Mengmi Zhang, Gabriel Kreiman

TL;DR

The paper tackles online stream learning, where models must continually learn from temporally coherent, non-repeating data without revisiting past samples. It introduces CRUMB, a differentiable codebook of memory blocks that compositionally reconstructs feature maps for memory-efficient replay, enabling performance close to offline upper bounds with only $3.6\%$ of the memory footprint of raw-image replay. CRUMB's pretraining induces a shape bias that stabilizes learning and reduces forgetting, and its replay operates at the feature level, yielding significant memory and runtime savings across seven continual-learning benchmarks and two newly adapted stream-learning datasets. The approach outperforms state-of-the-art baselines in most class-i.i.d. and class-instance settings, offers strong scalability to large datasets, and is adaptable across CNN architectures, making it well-suited for edge devices and robotic learning scenarios minus substantial data-storage overhead.

Abstract

Our brains extract durable, generalizable knowledge from transient experiences of the world. Artificial neural networks come nowhere close to this ability. When tasked with learning to classify objects by training on non-repeating video frames in temporal order (online stream learning), models that learn well from shuffled datasets catastrophically forget old knowledge upon learning new stimuli. We propose a new continual learning algorithm, Compositional Replay Using Memory Blocks (CRUMB), which mitigates forgetting by replaying feature maps reconstructed by combining generic parts. CRUMB concatenates trainable and re-usable "memory block" vectors to compositionally reconstruct feature map tensors in convolutional neural networks. Storing the indices of memory blocks used to reconstruct new stimuli enables memories of the stimuli to be replayed during later tasks. This reconstruction mechanism also primes the neural network to minimize catastrophic forgetting by biasing it towards attending to information about object shapes more than information about image textures, and stabilizes the network during stream learning by providing a shared feature-level basis for all training examples. These properties allow CRUMB to outperform an otherwise identical algorithm that stores and replays raw images, while occupying only 3.6% as much memory. We stress-tested CRUMB alongside 13 competing methods on 7 challenging datasets. To address the limited number of existing online stream learning datasets, we introduce 2 new benchmarks by adapting existing datasets for stream learning. With only 3.7-4.1% as much memory and 15-43% as much runtime, CRUMB mitigates catastrophic forgetting more effectively than the state-of-the-art. Our code is available at https://github.com/MorganBDT/crumb.git.

Tuned Compositional Feature Replays for Efficient Stream Learning

TL;DR

Abstract

Tuned Compositional Feature Replays for Efficient Stream Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)