Statistical mechanics of the maximum-average submatrix problem
Vittorio Erba, Florent Krzakala, Rodrigo Pérez, Lenka Zdeborová
TL;DR
This work reframes the maximum-average submatrix problem as a mean-field spin-glass model with fixed magnetization, enabling a full replica analysis of its thermodynamics. By solving a 1-RSB variational problem and inspecting stability, the authors map out a rich phase diagram in the submatrix size $m$ and average $a$, including RS, dynamical/ static 1-RSB, full-RSB, and UNSAT phases, with a frozen 1-RSB regime emerging as $m\to0$. They connect the $m\to0$ limit to REM-like behavior, link dynamic/static thresholds to known LAS results, and reveal algorithmic implications where methods like Incremental Greedy Procedure can succeed within certain frozen phases. The study also extends to rectangular MAS via bipartite SK mappings and shows, in the square case, equivalence with the symmetric principal MAS analysis, while leveraging Panchenko’s full-RSB framework for exact asymptotics. Overall, the work provides a rigorous statistical-mechanics portrait of MAS, illuminating phase structure, algorithmic regimes, and cross-model universality that informs both theory and potential heuristics for large-scale biclustering tasks.
Abstract
We study the maximum-average submatrix problem, in which given an $N \times N$ matrix $J$ one needs to find the $k \times k$ submatrix with the largest average of entries. We study the problem for random matrices $J$ whose entries are i.i.d. random variables by mapping it to a variant of the Sherrington-Kirkpatrick spin-glass model at fixed magnetization. We characterize analytically the phase diagram of the model as a function of the submatrix average and the size of the submatrix $k$ in the limit $N\to\infty$. We consider submatrices of size $k = m N$ with $0 < m < 1$. We find a rich phase diagram, including dynamical, static one-step replica symmetry breaking and full-step replica symmetry breaking. In the limit of $m \to 0$, we find a simpler phase diagram featuring a frozen 1-RSB phase, where the Gibbs measure is composed of exponentially many pure states each with zero entropy. We discover an interesting phenomenon, reminiscent of the phenomenology of the binary perceptron: there exist efficient algorithms that provably work in the frozen 1-RSB phase.
