Understanding Catastrophic Interference: On the Identifibility of Latent Representations
Yuke Li, Yujia Zheng, Tianyi Xiong, Zhenyi Wang, Heng Huang
TL;DR
The paper tackles catastrophic forgetting by treating it as a latent-variable identifiability problem between partial-task aware and all-task aware data-generating processes. It proves that forgetting can be quantified by the distance between PTA and ATA latent representations and mitigated by identifying and aligning their shared latent variables through a two-stage training procedure (MLE latent recovery followed by KL-based alignment). The ICON framework leverages flow-based networks to model PTA and ATA, provides identifiability guarantees under stated assumptions, and demonstrates superior performance on synthetic data and real-world benchmarks (CIFAR-100 and ImageNet-100) compared with state-of-the-art methods. This work bridges theory and practice, offering a principled approach to continual learning through latent-variable identifiability with practical strategies for real-world continual adaptation.
Abstract
Catastrophic interference, also known as catastrophic forgetting, is a fundamental challenge in machine learning, where a trained learning model progressively loses performance on previously learned tasks when adapting to new ones. In this paper, we aim to better understand and model the catastrophic interference problem from a latent representation learning point of view, and propose a novel theoretical framework that formulates catastrophic interference as an identification problem. Our analysis demonstrates that the forgetting phenomenon can be quantified by the distance between partial-task aware (PTA) and all-task aware (ATA) setups. Building upon recent advances in identifiability theory, we prove that this distance can be minimized through identification of shared latent variables between these setups. When learning, we propose our method \ourmeos with two-stage training strategy: First, we employ maximum likelihood estimation to learn the latent representations from both PTA and ATA configurations. Subsequently, we optimize the KL divergence to identify and learn the shared latent variables. Through theoretical guarantee and empirical validations, we establish that identifying and learning these shared representations can effectively mitigate catastrophic interference in machine learning systems. Our approach provides both theoretical guarantees and practical performance improvements across both synthetic and benchmark datasets.
