Data-Driven Priors in the Maximum Entropy on the Mean Method for Linear Inverse Problems
Matthew King-Roskamp, Rustum Choksi, Tim Hoheisel
TL;DR
The paper develops a data-driven MEM framework for linear inverse problems, establishing almost-sure convergence of MEM solutions when priors are estimated empirically. By exploiting epi-convergence between log-moment generating functions and the MEM dual, it proves convergence of empirical MEM solutions $\overline{x}_{n,\varepsilon}$ to the true MEM solution $\overline{x}_{\mu}$ and derives explicit convergence rates in the quadratic-fidelity setting, including a $\mathcal{O}(n^{-1/4})$ rate in expectation for empirical priors. The approach hinges on epigraphical distances and epi-consistency of the MGF and dual objective, yielding rigorous guarantees for data-driven priors in linear inverse problems. Numerical denoising experiments on MNIST and Fashion-MNIST validate the theory and illustrate practical use, including measure-level postprocessing of the optimal prior.
Abstract
We establish the theoretical framework for implementing the maximumn entropy on the mean (MEM) method for linear inverse problems in the setting of approximate (data-driven) priors. We prove a.s. convergence for empirical means and further develop general estimates for the difference between the MEM solutions with different priors $μ$ and $ν$ based upon the epigraphical distance between their respective log-moment generating functions. These estimates allow us to establish a rate of convergence in expectation for empirical means. We illustrate our results with denoising on MNIST and Fashion-MNIST data sets.
