Hierarchical Uncertainty Exploration via Feedforward Posterior Trees
Elias Nehme, Rotem Mulayoff, Tomer Michaeli
TL;DR
Ill-posed imaging inverse problems yield multi-modal posteriors that are difficult to inspect. The authors introduce posterior trees, a single-pass neural model that outputs a depth-$d$, $K$-ary tree of prototypes and associated probabilities to visualize $p_{\mathbf{x}|\mathbf{y}}(\mathbf{x}|\mathbf{y})$ at multiple granularities. The method builds a bottom-up tree whose root matches the MMSE estimator $\hat{\mathbf{x}}_{\mathrm{MMSE}}=\mathbb{E}[\mathbf{x}|\mathbf{y}]$, using an amortized oracle loss and an annealed regularization to prevent collapse, plus a weighted sampling scheme to handle imbalanced posteriors. Across denoising, colorization, inpainting, and bioimage translation, posterior trees achieve competitive performance with diffusion-based baselines while offering dramatically faster inference ($\approx$7 ms per image on a GPU) and enabling efficient, interactive uncertainty exploration in practice.
Abstract
When solving ill-posed inverse problems, one often desires to explore the space of potential solutions rather than be presented with a single plausible reconstruction. Valuable insights into these feasible solutions and their associated probabilities are embedded in the posterior distribution. However, when confronted with data of high dimensionality (such as images), visualizing this distribution becomes a formidable challenge, necessitating the application of effective summarization techniques before user examination. In this work, we introduce a new approach for visualizing posteriors across multiple levels of granularity using tree-valued predictions. Our method predicts a tree-valued hierarchical summarization of the posterior distribution for any input measurement, in a single forward pass of a neural network. We showcase the efficacy of our approach across diverse datasets and image restoration challenges, highlighting its prowess in uncertainty quantification and visualization. Our findings reveal that our method performs comparably to a baseline that hierarchically clusters samples from a diffusion-based posterior sampler, yet achieves this with orders of magnitude greater speed.
