Table of Contents
Fetching ...

Fisher-Guided Selective Forgetting: Mitigating The Primacy Bias in Deep Reinforcement Learning

Massimiliano Falzari, Matthia Sabatelli

TL;DR

PB in DRL causes early experiences to disproportionately shape learning, hindering adaptation. The authors introduce Fisher-Guided Selective Forgetting (FGSF), which uses the Fisher Information Matrix to identify PB dynamics and perform targeted, stochastic weight perturbations as a form of selective unlearning. Across DeepMind Control Suite tasks, FGSF improves final performance and stability relative to SAC and reset baselines, and shows robustness to replay ratios while revealing that the critic is more PB-prone than the actor. By marrying information geometry with machine unlearning, the work demonstrates a practical, geometry-aware approach to bias mitigation with potential for broader DRL applicability. While incurring modest computational overhead and requiring hyperparameter tuning, FGSF highlights a promising direction for bias-aware, data-efficient reinforcement learning and transferability of information-geometric ideas to learning dynamics.

Abstract

Deep Reinforcement Learning (DRL) systems often tend to overfit to early experiences, a phenomenon known as the primacy bias (PB). This bias can severely hinder learning efficiency and final performance, particularly in complex environments. This paper presents a comprehensive investigation of PB through the lens of the Fisher Information Matrix (FIM). We develop a framework characterizing PB through distinct patterns in the FIM trace, identifying critical memorization and reorganization phases during learning. Building on this understanding, we propose Fisher-Guided Selective Forgetting (FGSF), a novel method that leverages the geometric structure of the parameter space to selectively modify network weights, preventing early experiences from dominating the learning process. Empirical results across DeepMind Control Suite (DMC) environments show that FGSF consistently outperforms baselines, particularly in complex tasks. We analyze the different impacts of PB on actor and critic networks, the role of replay ratios in exacerbating the effect, and the effectiveness of even simple noise injection methods. Our findings provide a deeper understanding of PB and practical mitigation strategies, offering a FIM-based geometric perspective for advancing DRL.

Fisher-Guided Selective Forgetting: Mitigating The Primacy Bias in Deep Reinforcement Learning

TL;DR

PB in DRL causes early experiences to disproportionately shape learning, hindering adaptation. The authors introduce Fisher-Guided Selective Forgetting (FGSF), which uses the Fisher Information Matrix to identify PB dynamics and perform targeted, stochastic weight perturbations as a form of selective unlearning. Across DeepMind Control Suite tasks, FGSF improves final performance and stability relative to SAC and reset baselines, and shows robustness to replay ratios while revealing that the critic is more PB-prone than the actor. By marrying information geometry with machine unlearning, the work demonstrates a practical, geometry-aware approach to bias mitigation with potential for broader DRL applicability. While incurring modest computational overhead and requiring hyperparameter tuning, FGSF highlights a promising direction for bias-aware, data-efficient reinforcement learning and transferability of information-geometric ideas to learning dynamics.

Abstract

Deep Reinforcement Learning (DRL) systems often tend to overfit to early experiences, a phenomenon known as the primacy bias (PB). This bias can severely hinder learning efficiency and final performance, particularly in complex environments. This paper presents a comprehensive investigation of PB through the lens of the Fisher Information Matrix (FIM). We develop a framework characterizing PB through distinct patterns in the FIM trace, identifying critical memorization and reorganization phases during learning. Building on this understanding, we propose Fisher-Guided Selective Forgetting (FGSF), a novel method that leverages the geometric structure of the parameter space to selectively modify network weights, preventing early experiences from dominating the learning process. Empirical results across DeepMind Control Suite (DMC) environments show that FGSF consistently outperforms baselines, particularly in complex tasks. We analyze the different impacts of PB on actor and critic networks, the role of replay ratios in exacerbating the effect, and the effectiveness of even simple noise injection methods. Our findings provide a deeper understanding of PB and practical mitigation strategies, offering a FIM-based geometric perspective for advancing DRL.

Paper Structure

This paper contains 20 sections, 5 equations, 13 figures, 3 tables, 1 algorithm.

Figures (13)

  • Figure 2: Learning curves showing episodic reward across different environments for baseline SAC (gold), reset method (teal), and FGSF (red). Shaded regions represent the minimum and maximum over 5 random seeds. Best viewed in colors.
  • Figure 3: Evolution of FIM trace (Tr(F)) for actor networks. Results compare baseline SAC (gold), reset method (teal), and FGSF (red). Shaded regions represent the minimum and maximum over 5 random seeds. Best viewed in colors.
  • Figure 4: Hyperparameter sensitivity analysis showing performance across different scrubbing coefficients ($\lambda \in [5 \times 10^{-6},5 \times 10^{-8}]$ ). The lighter the color the higher the coefficient. Shaded regions represent the minimum and maximum over 5 random seeds. Best viewed in colors.
  • Figure 5: Performance comparison and FIM trace evolution on the Quadruped under different replay ratios (2 left column and 4 right column) for baseline SAC, reset method, and FGSF. Higher replay ratios amplify the differences between methods. Best viewed in colors.
  • Figure 6: Learning curves showing episodic reward between critic-only scrubbing (red) and full network scrubbing (teal) for the different environments. Best viewed in colors.
  • ...and 8 more figures