Table of Contents
Fetching ...

Adaptive minimax optimality in statistical inverse problems via SOLIT -- Sharp Optimal Lepskii-Inspired Tuning

Housen Li, Frank Werner

TL;DR

This work addresses adaptive parameter selection in filter-based regularization for statistical inverse problems, introducing SOLIT, a Lepski-inspired tuning rule that relies only on data, the noise level, T, and the filter qα. The authors establish an oracle inequality for the MSE and prove that SOLIT attains adaptive minimax convergence rates across mildly and, under certain conditions, severely ill-posed settings, without incurring extra logarithmic penalties. They provide a practical implementation framework, including line-search-based candidate construction and quantile-based thresholds, and demonstrate through numerical experiments that SOLIT closely matches oracle performance across diverse ill-posedness regimes. The results suggest SOLIT as a robust, scalable, and tuning-free approach for adaptive inverse problem solving with broad applicability, while highlighting open questions for severe ill-posedness and non-Gaussian noise. Overall, SOLIT challenges the conventional belief that Lepski-type methods necessarily incur log-factor losses, delivering sharp adaptive rates in many scenarios and offering practical computational strategies for large-scale problems.

Abstract

We consider statistical linear inverse problems in separable Hilbert spaces and filter-based reconstruction methods of the form $\hat f_α= q_α\left(T^*T\right)T^*Y$, where $Y$ is the available data, $T$ the forward operator, $\left(q_α\right)_{α\in \mathcal A}$ an ordered filter, and $α> 0$ a regularization parameter. Whenever such a method is used in practice, $α$ has to be appropriately chosen. Typically, the aim is to find or at least approximate the best possible $α$ in the sense that mean squared error (MSE) $\mathbb E [\Vert \hat f_α- f^\dagger\Vert^2]$ w.r.t.~the true solution $f^\dagger$ is minimized. In this paper, we introduce the Sharp Optimal Lepskiĭ-Inspired Tuning (SOLIT) method, which yields an a posteriori parameter choice rule ensuring adaptive minimax rates of convergence. It depends only on $Y$ and the noise level $σ$ as well as the operator $T$ and the filter $\left(q_α\right)_{α\in \mathcal A}$ and does not require any problem-dependent tuning of further parameters. We prove an oracle inequality for the corresponding MSE in a general setting and derive the rates of convergence in different scenarios. By a careful analysis we show that no other a posteriori parameter choice rule can yield a better performance in terms of the order of the convergence rate of the MSE. In particular, our results reveal that the typical understanding of Lepski\uı-type methods in inverse problems leading to a loss of a log factor is wrong. In addition, the empirical performance of SOLIT is examined in simulations.

Adaptive minimax optimality in statistical inverse problems via SOLIT -- Sharp Optimal Lepskii-Inspired Tuning

TL;DR

This work addresses adaptive parameter selection in filter-based regularization for statistical inverse problems, introducing SOLIT, a Lepski-inspired tuning rule that relies only on data, the noise level, T, and the filter qα. The authors establish an oracle inequality for the MSE and prove that SOLIT attains adaptive minimax convergence rates across mildly and, under certain conditions, severely ill-posed settings, without incurring extra logarithmic penalties. They provide a practical implementation framework, including line-search-based candidate construction and quantile-based thresholds, and demonstrate through numerical experiments that SOLIT closely matches oracle performance across diverse ill-posedness regimes. The results suggest SOLIT as a robust, scalable, and tuning-free approach for adaptive inverse problem solving with broad applicability, while highlighting open questions for severe ill-posedness and non-Gaussian noise. Overall, SOLIT challenges the conventional belief that Lepski-type methods necessarily incur log-factor losses, delivering sharp adaptive rates in many scenarios and offering practical computational strategies for large-scale problems.

Abstract

We consider statistical linear inverse problems in separable Hilbert spaces and filter-based reconstruction methods of the form , where is the available data, the forward operator, an ordered filter, and a regularization parameter. Whenever such a method is used in practice, has to be appropriately chosen. Typically, the aim is to find or at least approximate the best possible in the sense that mean squared error (MSE) w.r.t.~the true solution is minimized. In this paper, we introduce the Sharp Optimal Lepskiĭ-Inspired Tuning (SOLIT) method, which yields an a posteriori parameter choice rule ensuring adaptive minimax rates of convergence. It depends only on and the noise level as well as the operator and the filter and does not require any problem-dependent tuning of further parameters. We prove an oracle inequality for the corresponding MSE in a general setting and derive the rates of convergence in different scenarios. By a careful analysis we show that no other a posteriori parameter choice rule can yield a better performance in terms of the order of the convergence rate of the MSE. In particular, our results reveal that the typical understanding of Lepski\uı-type methods in inverse problems leading to a loss of a log factor is wrong. In addition, the empirical performance of SOLIT is examined in simulations.
Paper Structure (27 sections, 8 theorems, 94 equations, 4 figures)

This paper contains 27 sections, 8 theorems, 94 equations, 4 figures.

Key Result

Lemma 3.1

Let $q_{\alpha}$ be an ordered filter such that some index function $\varphi$ is a qualification. Then the function $V_{T,q} : \mathcal{A} \to \mathbb R$ obeys the following properties:

Figures (4)

  • Figure 1: Simulation results for the antiderivate problem. Depicted are different noise levels $\sigma$ (x-axis) against empirical mean squared errors ${\mathbb E}\left[ \left\Vert \widehat{f}_\alpha - f\right\Vert_{\mathcal{X}}^2 \right]$ simulated in $M = 10^4$ Monte Carlo runs, for different parameter choices $\alpha$, namely SOLIT (eq. \ref{['e:limit:a']}--\ref{['e:limit:d']}, \ref{['sho_antider_solit']}), optimal alpha (eq. \ref{['eq:alpha_opt']}, \ref{['sho_antider_opt']}) and the oracle choice (eq. \ref{['e:orcr']}, \ref{['sho_antider_oracle']}). Furthermore shown is the empirical price of adaptation (eq. \ref{['eq:poa']}, \ref{['sho_antider_poa']}), and a slope (\ref{['sho_antider_orc']}) indicating the optimal rate of convergence $\mathcal{O} \left(\sigma^{3/4}\right)$.
  • Figure 2: Results for the gradiometry problem. Depicted are different noise levels $\sigma$ (x-axis) against empirical mean squared errors ${\mathbb E}\left[ \left\Vert \widehat{f}_\alpha - f\right\Vert_{\mathcal{X}}^2 \right]$ simulated in $M = 10^4$ Monte Carlo runs, for different parameter choices $\alpha$, namely SOLIT (eq. \ref{['e:limit:a']}--\ref{['e:limit:d']}, \ref{['sho_grad_solit']}), optimal alpha (eq. \ref{['eq:alpha_opt']}, \ref{['sho_grad_opt']}) and the oracle choice (eq. \ref{['e:orcr']}, \ref{['sho_grad_oracle']}). Furthermore shown is the empirical price of adaptation (eq. \ref{['eq:poa']}, \ref{['sho_grad_poa']}) and a slope (\ref{['sho_grad_orc']}) indicating the optimal rate of convergence $\mathcal{O} \left(\left(-\log\sigma\right)^{-3}\right)$.
  • Figure 3: Results for the heat problem. Depicted are different noise levels $\sigma$ (x-axis) against empirical mean squared errors ${\mathbb E}\left[ \left\Vert \widehat{f}_\alpha - f\right\Vert_{\mathcal{X}}^2 \right]$ simulated in $M = 10^4$ Monte Carlo runs, for different parameter choices $\alpha$, namely SOLIT (eq. \ref{['e:limit:a']}--\ref{['e:limit:d']}, \ref{['sho_heat_solit']}), optimal alpha (eq. \ref{['eq:alpha_opt']}, \ref{['sho_heat_opt']}) and the oracle choice (eq. \ref{['e:orcr']}, \ref{['sho_heat_oracle']}). Furthermore shown is the empirical price of adaptation (eq. \ref{['eq:poa']}, \ref{['sho_heat_poa']}) and a slope (\ref{['sho_heat_ocr']}) indicating the optimal rate of convergence $\mathcal{O} \left(\left(-\log\sigma\right)^{-\frac{3}{2}}\right)$.
  • Figure 4: Comparison of SOLIT with the classical Lepskiı-type balancing principle \ref{['eq:lepskij']} for all three different testing problems with Tikhonov regularization. Depicted are different noise levels $\sigma$ (x-axis) against empirical mean squared errors ${\mathbb E}\left[ \left\Vert \widehat{f}_\alpha - f\right\Vert_{\mathcal{X}}^2 \right]$ simulated in $M = 10^4$ Monte Carlo runs, for different parameter choices $\alpha$, namely, SOLIT (eq. \ref{['e:limit:a']}--\ref{['e:limit:d']}, \ref{['sho_heat_solit']}), Lepskiı (eq. \ref{['eq:lepskij']} with $\kappa = 1$, \ref{['lep']}) and the optimal rate of convergence (\ref{['sho_heat_ocr']}).

Theorems & Definitions (18)

  • Definition 1: Ordered filters
  • Lemma 3.1
  • proof
  • Lemma 3.2
  • proof
  • Definition 2: SOLIT
  • Theorem 3.3: Oracle inequality SpWi19
  • proof
  • Lemma 3.4: Bias--variance decomposition BHMR07
  • proof
  • ...and 8 more