An Achievable and Analytic Solution to Information Bottleneck for Gaussian Mixtures
Yi Song, Kai Wan, Zhenyu Liao, Giuseppe Caire
TL;DR
This work addresses an information bottleneck problem arising in a remote source coding setup where a binary source $Y$ is observed through a Gaussian mixture $X$, and an intermediate node compresses the observation into $T$ under a rate constraint $I(X; T) \le R$ to maximize $I(Y; T)$ under log-loss. It proposes three analytically tractable IB schemes—two-level random quantization, multi-level deterministic quantization, and soft quantization using $\tanh$—as achievable solutions, with BA serving as a numerical optimum benchmark and information dropout as a competing approach. The results show that the proposed schemes achieve near-optimal performance across SNRs, outperform the information dropout baseline, and extend to vector mixture Gaussian observations, with applications to binary classification under information leakage and MNIST-based validation. The paper also connects IB to remote source coding and demonstrates practical, closed-form strategies for efficient, privacy-aware information extraction relevant to communications and learning systems.
Abstract
In this paper, we study a remote source coding scenario in which binary phase shift keying (BPSK) modulation sources are corrupted by additive white Gaussian noise (AWGN). An intermediate node, such as a relay, receives these observations and performs additional compression to balance complexity and relevance. This problem can be further formulated as an information bottleneck (IB) problem with Bernoulli sources and Gaussian mixture observations. However, no closed-form solution exists for this IB problem. To address this challenge, we propose a unified achievable scheme that employs three different compression/quantization strategies for intermediate node processing by using two-level quantization, multi-level deterministic quantization, and soft quantization with the hyperbolic tangent ($\tanh$) function, respectively. In addition, we extend our analysis to the vector mixture Gaussian observation problem and explore its application in machine learning for binary classification with information leakage. Numerical evaluations show that the proposed scheme has a near-optimal performance over various signal-to-noise ratios (SNRs), compared to the Blahut-Arimoto (BA) algorithm, and has better performance than some existing numerical methods such as the information dropout approach. Furthermore, experiments conducted on the realistic MNIST dataset also validate the superior classification accuracy of our method compared to the information dropout approach.
