Analysis of the Memorization and Generalization Capabilities of AI Agents: Are Continual Learners Robust?
Minsu Kim, Walid Saad
TL;DR
The paper tackles robustness in continual learning under non-stationary environments by jointly optimizing memorization and generalization using a capacity-limited memory $|\mathcal{M}_t|$. It introduces a probabilistic relaxation of worst-case risk across environments via the inverse-CDF $F_{\mathcal{R}}^{-1}(\alpha;\theta_t)$ and estimates the risk distribution from memory and current data. Theoretical results bound the memorization-generalization tradeoff as a function of memory size, showing that larger memory can hinder simultaneous optimality across all stored environments. Empirically, on rotated MNIST, the approach yields improved generalization to unseen rotations compared with memory-based baselines, supporting its potential for robust deployment in autonomous systems.
Abstract
In continual learning (CL), an AI agent (e.g., autonomous vehicles or robotics) learns from non-stationary data streams under dynamic environments. For the practical deployment of such applications, it is important to guarantee robustness to unseen environments while maintaining past experiences. In this paper, a novel CL framework is proposed to achieve robust generalization to dynamic environments while retaining past knowledge. The considered CL agent uses a capacity-limited memory to save previously observed environmental information to mitigate forgetting issues. Then, data points are sampled from the memory to estimate the distribution of risks over environmental change so as to obtain predictors that are robust with unseen changes. The generalization and memorization performance of the proposed framework are theoretically analyzed. This analysis showcases the tradeoff between memorization and generalization with the memory size. Experiments show that the proposed algorithm outperforms memory-based CL baselines across all environments while significantly improving the generalization performance on unseen target environments.
