MINDE: Mutual Information Neural Diffusion Estimation
Giulio Franzese, Mustapha Bounoua, Pietro Michiardi
TL;DR
MINDE introduces a diffusion-based framework for mutual information estimation by expressing KL divergence as a difference of score functions via a Girsanov-inspired view. It provides two modeling directions—conditional diffusion and joint diffusion—and a family of estimators that combine score networks with entropy calculations, achieving strong accuracy on challenging distributions and passing self-consistency tests. The method demonstrates robustness across high-dimensional, transformed, and sparse-interaction scenarios and offers potential for scaling to multi-modal and multi-variable MI analysis. Overall, MINDE expands the information-theoretic toolkit for neural estimators by leveraging score-based diffusion, enabling precise MI and entropy estimates in complex data settings.
Abstract
In this work we present a new method for the estimation of Mutual Information (MI) between random variables. Our approach is based on an original interpretation of the Girsanov theorem, which allows us to use score-based diffusion models to estimate the Kullback Leibler divergence between two densities as a difference between their score functions. As a by-product, our method also enables the estimation of the entropy of random variables. Armed with such building blocks, we present a general recipe to measure MI, which unfolds in two directions: one uses conditional diffusion process, whereas the other uses joint diffusion processes that allow simultaneous modelling of two random variables. Our results, which derive from a thorough experimental protocol over all the variants of our approach, indicate that our method is more accurate than the main alternatives from the literature, especially for challenging distributions. Furthermore, our methods pass MI self-consistency tests, including data processing and additivity under independence, which instead are a pain-point of existing methods.
