Mutual Information Estimation via Normalizing Flows
Ivan Butakov, Alexander Tolmachev, Sofia Malanchuk, Anna Neopryatnaya, Alexey Frolov
TL;DR
This work tackles high-dimensional mutual information estimation by leveraging normalizing flows to map (X,Y) into latent representations where MI is more tractable. By comparing a general MIENF framework with Gaussian-base confinements, it derives closed-form MI expressions and non-asymptotic bounds, including a scalable, parameter-efficient tri-diagonal Gaussian variant. The proposed approach yields consistent MI estimates under suitable conditions and demonstrates competitive performance against established MI estimators on synthetic, high-dimensional data. The method offers a practical, low-variance MI estimator with theoretical guarantees, extendable to a broader class of base distributions and injective generative models for complex data.
Abstract
We propose a novel approach to the problem of mutual information (MI) estimation via introducing a family of estimators based on normalizing flows. The estimator maps original data to the target distribution, for which MI is easier to estimate. We additionally explore the target distributions with known closed-form expressions for MI. Theoretical guarantees are provided to demonstrate that our approach yields MI estimates for the original data. Experiments with high-dimensional data are conducted to highlight the practical advantages of the proposed method.
