Table of Contents
Fetching ...

Is model selection possible for the $\ell_p$-loss? PCO estimation for regression models

Claire Lacour, Pascal Massart, Vincent Rivoirard

TL;DR

This work develops a model-selection framework for non-Euclidean losses in a sub-Gaussian sequence model by introducing the Penalized Comparison to Overfitting (PCO) procedure for weighted $\ell_p$ losses with $p\ge 1$. A general oracle inequality is established, with concentration results for sums of $|\xi_{\lambda}|^p$ via sub-Weibull bounds guiding penalty design; refined inequalities accommodate large model collections, revealing an elbow at $p=2$. The authors derive minimax convergence rates on Besov spaces for both constant and non-constant weights, identifying homogeneous, intermediate, and sparse regimes and proving adaptivity of the PCO estimator (including logarithmic losses in some cases). They extend the approach to nonparametric regression using wavelet representations, obtaining adaptive $\mathbb{L}_p$-risk rates for functions in Besov spaces and outlining the core steps of the functional-proof strategy. Overall, the paper offers a practically efficient single-minimization model-selection method that achieves near-optimal risk bounds for a broad class of Besov-sparse signals under non-Euclidean loss.

Abstract

This paper addresses the problem of model selection in the sequence model $Y=θ+\varepsilonξ$, when $ξ$ is sub-Gaussian, for non-euclidian loss-functions. In this model, the Penalized Comparison to Overfitting procedure is studied for the weighted $\ell_p$-loss, $p\geq 1.$ Several oracle inequalities are derived from concentration inequalities for sub-Weibull variables. Using judicious collections of models and penalty terms, minimax rates of convergence are stated for Besov bodies $\mathcal{B}_{r,\infty}^s$. These results are applied to the functional model of nonparametric regression.

Is model selection possible for the $\ell_p$-loss? PCO estimation for regression models

TL;DR

This work develops a model-selection framework for non-Euclidean losses in a sub-Gaussian sequence model by introducing the Penalized Comparison to Overfitting (PCO) procedure for weighted losses with . A general oracle inequality is established, with concentration results for sums of via sub-Weibull bounds guiding penalty design; refined inequalities accommodate large model collections, revealing an elbow at . The authors derive minimax convergence rates on Besov spaces for both constant and non-constant weights, identifying homogeneous, intermediate, and sparse regimes and proving adaptivity of the PCO estimator (including logarithmic losses in some cases). They extend the approach to nonparametric regression using wavelet representations, obtaining adaptive -risk rates for functions in Besov spaces and outlining the core steps of the functional-proof strategy. Overall, the paper offers a practically efficient single-minimization model-selection method that achieves near-optimal risk bounds for a broad class of Besov-sparse signals under non-Euclidean loss.

Abstract

This paper addresses the problem of model selection in the sequence model , when is sub-Gaussian, for non-euclidian loss-functions. In this model, the Penalized Comparison to Overfitting procedure is studied for the weighted -loss, Several oracle inequalities are derived from concentration inequalities for sub-Weibull variables. Using judicious collections of models and penalty terms, minimax rates of convergence are stated for Besov bodies . These results are applied to the functional model of nonparametric regression.

Paper Structure

This paper contains 34 sections, 21 theorems, 291 equations.

Key Result

Theorem 2.1

If $p>1$, for any arbitrary $m\in \mathcal{M}$, we have for any $\alpha\in (0,2)$: where $M_{p,\alpha}$ depends only on $p$ and $\alpha$. In particular, with $\alpha=1$, we obtain: where $M_p=M_{p,1}$ is given in Equation valeurdeM (see the proof). If $p=1$, the previous inequalities are true by replacing $\frac{2}{\alpha}$ by $\frac{1}{\alpha\wedge 1}$ in the right hand side of the first inequa

Theorems & Definitions (32)

  • Theorem 2.1
  • Remark 2.2
  • Theorem 2.3
  • Corollary 2.4
  • proof
  • Theorem 2.5
  • Remark 2.6
  • Theorem 2.7
  • Theorem 3.1
  • Theorem 3.2
  • ...and 22 more