A Thermodynamic Theory of Learning Part II: Critical Period Closure and Continual Learning Failure
Daisuke Okanohara
TL;DR
This work develops a thermodynamic view of continual learning, treating learning as a finite-time transport in parameter distributions and showing that irreversibility, quantified by entropy production, constrains not only final solutions but the entire learning trajectory. Building on the Epistemic Speed Limit, the paper introduces a trajectory-level analysis with a free-energy Lyapunov framework and a geometric notion of task-preserving entropy, revealing how excess dissipation selectively erodes task-equivariant degrees of freedom and leads to critical period closure. It formalizes a dissipation-driven reachability obstruction: after a reference task, a bounded dissipation budget restricts future learning to a Wasserstein ball around the initial distribution, making some new-task regions dynamically inaccessible and causing forgetting if those regions lie outside the reachable set. The results reinterpret catastrophic forgetting as an irreversible dynamical constraint under finite-time learning, offering principles for preserving task-equivalent directions and guiding future algorithm design within non-equilibrium thermodynamic limits.
Abstract
Learning performed over finite time is necessarily irreversible. In Part~I of this series, we modeled learning as a transport process in the space of parameter distributions and derived the Epistemic Speed Limit, which lower-bounds entropy production under finite-time learning. In this work (Part~II), we study the consequences of this irreversibility for continual learning from a trajectory-level perspective. We show that finite dissipation constrains not only which solutions are reachable, but which learning paths remain dynamically accessible. Although a continuum of task-equivalent realizations can achieve identical task performance, finite-time learning irreversibly selects among these realizations. This selection occurs through the progressive elimination of degrees of freedom that would otherwise enable structural reconfiguration. We refer to this phenomenon as \emph{critical period closure}: beyond a certain stage of learning, transitions between compatible representations become dynamically inaccessible under any finite dissipation budget. As a result, continual learning failure arises not from the absence of solutions satisfying multiple tasks, but from an irreversible loss of representational freedom induced by prior learning. This reframes catastrophic forgetting as a dynamical constraint imposed by finite-time dissipation, rather than direct task interference.
