Towards Healing the Blindness of Score Matching
Mingtian Zhang, Oscar Key, Peter Hayes, David Barber, Brooks Paige, François-Xavier Briol
TL;DR
The paper tackles the blindness of score-based divergences in multi-modal settings by introducing the Mixture Fisher Divergence (MFD), which bridges disconnected supports with a mixing density $m$ and mixing weight $\beta$. It proves MFD is a valid divergence on disconnected supports and demonstrates its practical benefits in density estimation by first training a bridged model on $\tilde{p}_d=\beta p_d+(1-\beta)m$ and then applying a correction step to recover the true density. In the energy-based-model setting, the authors propose a three-step pipeline that avoids explicit normalization during training, yielding substantial improvements in KL divergence over standard FD on mixtures of Gaussians and concentric circles. Overall, MFD provides a principled way to extend score-based divergences to multi-modal distributions, offering a practical path to more robust density estimation and related score-based tasks.
Abstract
Score-based divergences have been widely used in machine learning and statistics applications. Despite their empirical success, a blindness problem has been observed when using these for multi-modal distributions. In this work, we discuss the blindness problem and propose a new family of divergences that can mitigate the blindness problem. We illustrate our proposed divergence in the context of density estimation and report improved performance compared to traditional approaches.
