Table of Contents
Fetching ...

Stable-Drift: A Patient-Aware Latent Drift Replay Method for Stabilizing Representations in Continual Learning

Paraskevi-Antonia Theofilou, Anuhya Thota, Stefanos Kollias, Mamatha Thota

TL;DR

The paper tackles catastrophic forgetting in continual learning for medical imaging under cross-hospital domain shifts. It introduces a latent drift-guided replay framework with patient-aware, multi-layer drift to selectively replay the most fragile samples, preserving prior knowledge while adapting to new data. Across Swin Transformer and ResNet-50 backbones on a cross-hospital COVID-19 CT task, the method substantially reduces forgetting and achieves a favorable stability-plasticity balance compared with naive fine-tuning and random replay, using a compact 30k-sample buffer. These results advance practical continual learning for dynamic clinical environments, where robust generalization and data efficiency are critical.

Abstract

When deep learning models are sequentially trained on new data, they tend to abruptly lose performance on previously learned tasks, a critical failure known as catastrophic forgetting. This challenge severely limits the deployment of AI in medical imaging, where models must continually adapt to data from new hospitals without compromising established diagnostic knowledge. To address this, we introduce a latent drift-guided replay method that identifies and replays samples with high representational instability. Specifically, our method quantifies this instability via latent drift, the change in a sample internal feature representation after naive domain adaptation. To ensure diversity and clinical relevance, we aggregate drift at the patient level, our memory buffer stores the per patient slices exhibiting the greatest multi-layer representation shift. Evaluated on a cross-hospital COVID-19 CT classification task using state-of-the-art CNN and Vision Transformer backbones, our method substantially reduces forgetting compared to naive fine-tuning and random replay. This work highlights latent drift as a practical and interpretable replay signal for advancing robust continual learning in real world medical settings.

Stable-Drift: A Patient-Aware Latent Drift Replay Method for Stabilizing Representations in Continual Learning

TL;DR

The paper tackles catastrophic forgetting in continual learning for medical imaging under cross-hospital domain shifts. It introduces a latent drift-guided replay framework with patient-aware, multi-layer drift to selectively replay the most fragile samples, preserving prior knowledge while adapting to new data. Across Swin Transformer and ResNet-50 backbones on a cross-hospital COVID-19 CT task, the method substantially reduces forgetting and achieves a favorable stability-plasticity balance compared with naive fine-tuning and random replay, using a compact 30k-sample buffer. These results advance practical continual learning for dynamic clinical environments, where robust generalization and data efficiency are critical.

Abstract

When deep learning models are sequentially trained on new data, they tend to abruptly lose performance on previously learned tasks, a critical failure known as catastrophic forgetting. This challenge severely limits the deployment of AI in medical imaging, where models must continually adapt to data from new hospitals without compromising established diagnostic knowledge. To address this, we introduce a latent drift-guided replay method that identifies and replays samples with high representational instability. Specifically, our method quantifies this instability via latent drift, the change in a sample internal feature representation after naive domain adaptation. To ensure diversity and clinical relevance, we aggregate drift at the patient level, our memory buffer stores the per patient slices exhibiting the greatest multi-layer representation shift. Evaluated on a cross-hospital COVID-19 CT classification task using state-of-the-art CNN and Vision Transformer backbones, our method substantially reduces forgetting compared to naive fine-tuning and random replay. This work highlights latent drift as a practical and interpretable replay signal for advancing robust continual learning in real world medical settings.

Paper Structure

This paper contains 30 sections, 5 equations, 2 figures, 7 tables.

Figures (2)

  • Figure 1: Samples of CT slices from our datasets.
  • Figure 2: Stability–plasticity trade-off (H1 vs H2), dashed line denotes equal stability/plasticity.