Table of Contents
Fetching ...

Parabolic Continual Learning

Haoming Yang, Ali Hasan, Vahid Tarokh

TL;DR

The work addresses continual learning under evolving data by enforcing a parabolic PDE constraint on the expected loss profile $u(x,t)$ with a memory-buffer boundary. The proposed Parabolic Continual Learner (PCL) combines diffusion-driven interior dynamics $\partial_t u = \sigma \nabla^2 u + \ell_{f_\theta}(x)$ (with optional first-order drift) and a boundary condition $u|_{\mathcal{M}^{(\epsilon)}} = \ell_{f_\theta}$, enabling tractable bounds on forgetting and generalization via the Feynman-Kac representation for scalable computation. Empirically, PCL achieves competitive or superior final accuracy and robustness across online class-incremental benchmarks, corrupted labels, and imbalanced data, while offering favorable efficiency compared to some replay-based baselines. The framework provides a unified PDE-based perspective that links regularization and memory replay through boundary-driven loss evolution, with potential extensions to broader modalities and control-theoretic connections.

Abstract

Regularizing continual learning techniques is important for anticipating algorithmic behavior under new realizations of data. We introduce a new approach to continual learning by imposing the properties of a parabolic partial differential equation (PDE) to regularize the expected behavior of the loss over time. This class of parabolic PDEs has a number of favorable properties that allow us to analyze the error incurred through forgetting and the error induced through generalization. Specifically, we do this through imposing boundary conditions where the boundary is given by a memory buffer. By using the memory buffer as a boundary, we can enforce long term dependencies by bounding the expected error by the boundary loss. Finally, we illustrate the empirical performance of the method on a series of continual learning tasks.

Parabolic Continual Learning

TL;DR

The work addresses continual learning under evolving data by enforcing a parabolic PDE constraint on the expected loss profile with a memory-buffer boundary. The proposed Parabolic Continual Learner (PCL) combines diffusion-driven interior dynamics (with optional first-order drift) and a boundary condition , enabling tractable bounds on forgetting and generalization via the Feynman-Kac representation for scalable computation. Empirically, PCL achieves competitive or superior final accuracy and robustness across online class-incremental benchmarks, corrupted labels, and imbalanced data, while offering favorable efficiency compared to some replay-based baselines. The framework provides a unified PDE-based perspective that links regularization and memory replay through boundary-driven loss evolution, with potential extensions to broader modalities and control-theoretic connections.

Abstract

Regularizing continual learning techniques is important for anticipating algorithmic behavior under new realizations of data. We introduce a new approach to continual learning by imposing the properties of a parabolic partial differential equation (PDE) to regularize the expected behavior of the loss over time. This class of parabolic PDEs has a number of favorable properties that allow us to analyze the error incurred through forgetting and the error induced through generalization. Specifically, we do this through imposing boundary conditions where the boundary is given by a memory buffer. By using the memory buffer as a boundary, we can enforce long term dependencies by bounding the expected error by the boundary loss. Finally, we illustrate the empirical performance of the method on a series of continual learning tasks.

Paper Structure

This paper contains 35 sections, 2 theorems, 26 equations, 2 figures, 10 tables, 2 algorithms.

Key Result

Proposition 1

Consider $u(x_{t-\tau},t-\tau)$ for $x_{t-\tau} \in \mathcal{C}_{\mathcal{M}_t}$ and $\tau \geq 0$. Additionally suppose that the learning landscape satisfies the following parabolic PDE Then the following inequality holds:

Figures (2)

  • Figure 1: Properties of parabolic continual learner. At time $s$, samples in the buffer $\mathcal{M}_s$ are evaluated to construct the boundary of expected loss profile $u(x,s)$. For $t>s$ a new task is introduced, PCL enforces the expected loss $u$ to satisfy the PDE in \ref{['eq:pde']} with boundary condition in \ref{['eq:bc']}. As long as $u(x,t)$ continues to satisfy the PDE, the loss profile of PCL can be anticipated for $t > s$.
  • Figure 2: Memory buffer $\mathcal{M}_t$ changes for different time steps and dictates the boundary on which the solution is found.

Theorems & Definitions (6)

  • Proposition 1: Upper bound on expected forgetting error
  • Corollary 1: Lower bound on expected generalization error
  • Remark 1: An Upper Bound on the Negative Log Loss
  • proof
  • proof
  • proof