GPS: Distilling Compact Memories via Grid-based Patch Sampling for Efficient Online Class-Incremental Learning
Mingchuan Ma, Yuhao Zhou, Jindi Lv, Yuxin Tian, Dan Si, Shujian Li, Qing Ye, Jiancheng Lv
TL;DR
The paper tackles online class-incremental learning under strict memory and access constraints by addressing the bottleneck of informative memory distillation. It introduces Grid-based Patch Sampling (GPS), a model-free, lightweight method that converts high-resolution images into compact, structure-preserving surrogates by sampling one pixel per grid patch, enabling many more memory exemplars under the same budget. GPS supports two replay pathways—concatenation-based reconstruction for training and NCM-based upsampling for inference—without requiring bi-level optimization or a converged backbone. Empirical results across CIFAR-100, Mini-ImageNet, and Tiny-ImageNet show 3–4% gains in average end accuracy over strong replay baselines, with minimal computational overhead, demonstrating GPS as a plug-and-play, scalable solution for memory-efficient online continual learning.
Abstract
Online class-incremental learning aims to enable models to continuously adapt to new classes with limited access to past data, while mitigating catastrophic forgetting. Replay-based methods address this by maintaining a small memory buffer of previous samples, achieving competitive performance. For effective replay under constrained storage, recent approaches leverage distilled data to enhance the informativeness of memory. However, such approaches often involve significant computational overhead due to the use of bi-level optimization. Motivated by these limitations, we introduce Grid-based Patch Sampling (GPS), a lightweight and effective strategy for distilling informative memory samples without relying on a trainable model. GPS generates informative samples by sampling a subset of pixels from the original image, yielding compact low-resolution representations that preserve both semantic content and structural information. During replay, these representations are reassembled to support training and evaluation. Experiments on extensive benchmarks demonstrate that GRS can be seamlessly integrated into existing replay frameworks, leading to 3%-4% improvements in average end accuracy under memory-constrained settings, with limited computational overhead.
