FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability-Plasticity Tradeoff
Isaac Han, Sangyeon Park, Seungwon Oh, Donghu Kim, Hojoon Lee, Kyung-Joong Kim
TL;DR
FIRE addresses the longstanding stability–plasticity tradeoff in continual learning by formulating reinitialization as a constrained optimization that minimizes the stability gap to past weights while enforcing isotropy to preserve plasticity. It introduces two differentiable metrics, Squared Frobenius Error $SFE$ and Deviation from Isometry $DfI$, and derives a principled projection onto an isotropic manifold using an orthogonal Procrustes view, approximated efficiently with a Newton–Schulz iteration. The method is validated across continual visual learning, continual pretraining of LLMs, and reinforcement learning, consistently beating naive training and standard reinitialization baselines with modest overhead. FIRE demonstrates that explicit control of the stability–plasticity tradeoff yields robust, transfer-friendly representations in nonstationary environments, with practical applicability across vision, language, and control domains.
Abstract
Deep neural networks trained on nonstationary data must balance stability (i.e., retaining prior knowledge) and plasticity (i.e., adapting to new tasks). Standard reinitialization methods, which reinitialize weights toward their original values, are widely used but difficult to tune: conservative reinitializations fail to restore plasticity, while aggressive ones erase useful knowledge. We propose FIRE, a principled reinitialization method that explicitly balances the stability-plasticity tradeoff. FIRE quantifies stability through Squared Frobenius Error (SFE), measuring proximity to past weights, and plasticity through Deviation from Isometry (DfI), reflecting weight isotropy. The reinitialization point is obtained by solving a constrained optimization problem, minimizing SFE subject to DfI being zero, which is efficiently approximated by Newton-Schulz iteration. FIRE is evaluated on continual visual learning (CIFAR-10 with ResNet-18), language modeling (OpenWebText with GPT-0.1B), and reinforcement learning (HumanoidBench with SAC and Atari games with DQN). Across all domains, FIRE consistently outperforms both naive training without intervention and standard reinitialization methods, demonstrating effective balancing of the stability-plasticity tradeoff.
