On the phase diagram of extensive-rank symmetric matrix denoising beyond rotational invariance

Jean Barbier; Francesco Camilli; Justin Ko; Koki Okajima

On the phase diagram of extensive-rank symmetric matrix denoising beyond rotational invariance

Jean Barbier, Francesco Camilli, Justin Ko, Koki Okajima

TL;DR

The paper investigates Bayesian matrix denoising for an extensive-rank signal ${\mathbf X}{\mathbf X}^{\intercal}$ without rotational invariance, aiming to map its information-theoretic limits via a phase diagram. It develops a novel multiscale mean-field framework that blends cavity-method reductions with effective scalar inference to compute the MMSE and MI, and introduces a complete ansatz that unifies universal (matrix-model) and non-universal (factorisation) regimes. A key finding is a denoising-factorisation transition along a line $\lambda_c(\alpha)$, with universality holding below this line and breaking beyond it for discrete priors, implying algorithmic hardness in the non-universal phase. The framework connects to replica calculations and HCIZ-based matrix-model results in the denoising phase, while providing a rigorous mean-field description in the factorisation phase, offering insights into when full factorisation of ${\mathbf X}$ is information-theoretically possible and how it may be achieved or approximated in practice.

Abstract

Matrix denoising is central to signal processing and machine learning. Its statistical analysis when the matrix to infer has a factorised structure with a rank growing proportionally to its dimension remains a challenge, except when it is rotationally invariant. In this case the information theoretic limits and an efficient Bayes-optimal denoising algorithm, called rotational invariant estimator [1,2], are known. Beyond this setting few results can be found. The reason is that the model is not a usual spin system because of the growing rank dimension, nor a matrix model (as appearing in high-energy physics) due to the lack of rotation symmetry, but rather a hybrid between the two. Here we make progress towards the understanding of Bayesian matrix denoising when the signal is a factored matrix $XX^\intercal$ that is not rotationally invariant. Monte Carlo simulations suggest the existence of a \emph{denoising-factorisation transition} separating a phase where denoising using the rotational invariant estimator remains Bayes-optimal due to universality properties of the same nature as in random matrix theory, from one where universality breaks down and better denoising is possible, though algorithmically hard. We argue that it is only beyond the transition that factorisation, i.e., estimating $X$ itself, becomes possible up to irresolvable ambiguities. On the theory side, we combine mean-field techniques in an interpretable multiscale fashion in order to access the minimum mean-square error and mutual information. Interestingly, our alternative method yields equations reproducible by the replica approach of [3]. Using numerical insights, we delimit the portion of phase diagram where we conjecture the mean-field theory to be exact, and correct it using universality when it is not. Our complete ansatz matches well the numerics in the whole phase diagram when considering finite size effects.

On the phase diagram of extensive-rank symmetric matrix denoising beyond rotational invariance

TL;DR

The paper investigates Bayesian matrix denoising for an extensive-rank signal

without rotational invariance, aiming to map its information-theoretic limits via a phase diagram. It develops a novel multiscale mean-field framework that blends cavity-method reductions with effective scalar inference to compute the MMSE and MI, and introduces a complete ansatz that unifies universal (matrix-model) and non-universal (factorisation) regimes. A key finding is a denoising-factorisation transition along a line

, with universality holding below this line and breaking beyond it for discrete priors, implying algorithmic hardness in the non-universal phase. The framework connects to replica calculations and HCIZ-based matrix-model results in the denoising phase, while providing a rigorous mean-field description in the factorisation phase, offering insights into when full factorisation of

is information-theoretically possible and how it may be achieved or approximated in practice.

Abstract

that is not rotationally invariant. Monte Carlo simulations suggest the existence of a \emph{denoising-factorisation transition} separating a phase where denoising using the rotational invariant estimator remains Bayes-optimal due to universality properties of the same nature as in random matrix theory, from one where universality breaks down and better denoising is possible, though algorithmically hard. We argue that it is only beyond the transition that factorisation, i.e., estimating

itself, becomes possible up to irresolvable ambiguities. On the theory side, we combine mean-field techniques in an interpretable multiscale fashion in order to access the minimum mean-square error and mutual information. Interestingly, our alternative method yields equations reproducible by the replica approach of [3]. Using numerical insights, we delimit the portion of phase diagram where we conjecture the mean-field theory to be exact, and correct it using universality when it is not. Our complete ansatz matches well the numerics in the whole phase diagram when considering finite size effects.

On the phase diagram of extensive-rank symmetric matrix denoising beyond rotational invariance

TL;DR

Abstract

On the phase diagram of extensive-rank symmetric matrix denoising beyond rotational invariance

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (16)

Theorems & Definitions (3)