Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation
Shuyu Yin, Fei Wen, Peilin Liu, Tao Luo
TL;DR
The paper addresses the problem that semi-gradient Q-learning lacks an explicit loss function, hindering analysis of implicit bias. It constructs an effective loss landscape using the Fokker–Planck equation (Wang's potential landscape) from partial data to visualize training dynamics and compare negative semi-gradient and residual-gradient forces. The key finding is that global minima of the true Bellman loss can become saddle points in the effective landscape, and this bias persists in high-dimensional neural networks, as shown in both small 2D examples and more realistic grid-world/DQN settings. The authors propose a three-step approach to probe implicit bias and provide public code to reproduce the visualizations, highlighting implications for understanding and diagnosing bias in semi-gradient RL methods.
Abstract
Semi-gradient Q-learning is applied in many fields, but due to the absence of an explicit loss function, studying its dynamics and implicit bias in the parameter space is challenging. This paper introduces the Fokker--Planck equation and employs partial data obtained through sampling to construct and visualize the effective loss landscape within a two-dimensional parameter space. This visualization reveals how the global minima in the loss landscape can transform into saddle points in the effective loss landscape, as well as the implicit bias of the semi-gradient method. Additionally, we demonstrate that saddle points, originating from the global minima in loss landscape, still exist in the effective loss landscape under high-dimensional parameter spaces and neural network settings. This paper develop a novel approach for probing implicit bias in semi-gradient Q-learning.
