Deep Generative Continual Learning using Functional LoRA: FunLoRA
Victor Enescu, Hichem Sahbi
TL;DR
This work tackles catastrophic forgetting in continual learning for deep generative models by introducing FunLoRA, a parameter-efficient conditioning mechanism that uses rank-1 LoRA updates whose effective rank is increased via functional transforms such as cosine, rshift, and powers applied to convolutional layers within a conditional flow-matching backbone. The approach trains only current-task parameters and uses a layer-importance criterion to select a small subset of the U-Net backbone for adaptation, avoiding retraining on past data and reducing memory and compute. The paper formalizes conditional flow matching with $L_{CFM}$ and defines FunLoRA as $\mathbf{F}_{\mathbf{y}} = \frac{1}{p} \sum_{i=1}^{p} \alpha_i f_i(\mathbf{A}_{\mathbf{y}}, \mathbf{B}_{\mathbf{y}})$, enabling expressive class-conditioned conditioning while remaining parameter-efficient. Empirical results on CIFAR-100 and ImageNet-100 show that FunLoRA outperforms diffusion-based incremental methods and even some pre-trained stable-diffusion baselines, with substantially fewer extra parameters and faster sampling, and gains are further amplified by COCO pretraining.
Abstract
Continual adaptation of deep generative models holds tremendous potential and critical importance, given their rapid and expanding usage in text and vision based applications. Incremental training, however, remains highly challenging due to catastrophic forgetting phenomenon, which makes it difficult for neural networks to effectively incorporate new knowledge. A common strategy consists in retraining the generative model on its own synthetic data in order to mitigate forgetting. Yet, such an approach faces two major limitations: (i) the continually increasing training time eventually becomes intractable, and (ii) reliance on synthetic data inevitably leads to long-term performance degradation, since synthetic samples lack the richness of real training data. In this paper, we attenuate these issues by designing a novel and more expressive conditioning mechanism for generative models based on low rank adaptation (LoRA), that exclusively employs rank 1 matrices, whose reparametrized matrix rank is functionally increased using carefully selected functions -- and dubbed functional LoRA: FunLoRA. Using this dynamic conditioning, the generative model is guaranteed to avoid catastrophic forgetting and needs only to be trained on data from the current task. Extensive experiments using flow-matching based models trained from scratch, showcase that our proposed parameter-efficient fine-tuning (PEFT) method surpasses prior state-of-the-art results based on diffusion models, reaching higher classification accuracy scores, while only requiring a fraction of the memory cost and sampling time.
