Non-Bayesian Post-Model-Selection Estimation as Estimation Under Model Misspecification

Nadav Harel; Tirza Routtenberg

Non-Bayesian Post-Model-Selection Estimation as Estimation Under Model Misspecification

Nadav Harel, Tirza Routtenberg

TL;DR

This work addresses estimation after model selection when the true model is unknown and may be misspecified. It formalizes three interpretations of post-model-selection estimation as misspecification problems (naive, normalized, selective inference), derives corresponding misspecified ML estimators and pseudo-true parameter vectors, and introduces the PS-MCRB to bound estimation performance. The paper shows that the selective-inference interpretation yields coherent likelihoods and the PSML estimator, achieving the lowest MSE and providing the most informative bounds compared to the oracle CRB. Through a Gaussian linear-model example with channel-detection, the authors demonstrate that MSNL and PSML outperform the naive MSL approach, with PS-MCRBs offering tighter performance guarantees aligned with the selection procedure. Overall, the selective-inference framework provides a practical, theoretically grounded approach to post-model-selection estimation under misspecification with meaningful implications for signal processing and related domains.

Abstract

In many parameter estimation problems, the exact model is unknown and is assumed to belong to a set of candidate models. In such cases, a predetermined data-based selection rule selects a parametric model from a set of candidates before the parameter estimation. The existing framework for estimation under model misspecification does not account for the selection process that led to the misspecified model. Moreover, in post-model-selection estimation, there are multiple candidate models chosen based on the observations, making the interpretation of the assumed model in the misspecified setting non-trivial. In this work, we present three interpretations to address the problem of non-Bayesian post-model-selection estimation as an estimation under model misspecification problem: the naive interpretation, the normalized interpretation, and the selective inference interpretation, and discuss their properties. For each of these interpretations, we developed the corresponding misspecified maximum likelihood estimator and the misspecified Cram$\acute{\text{e}}$r-Rao-type lower bound. The relations between the estimators and the performance bounds, as well as their properties, are discussed. Finally, we demonstrate the performance of the proposed estimators and bounds via simulations of estimation after channel selection. We show that the proposed performance bounds are more informative than the oracle Cram$\acute{\text{e}}$r-Rao Bound (CRB), where the third interpretation (selective inference) results in the lowest mean-squared-error (MSE) among the estimators.

Non-Bayesian Post-Model-Selection Estimation as Estimation Under Model Misspecification

TL;DR

Abstract

r-Rao-type lower bound. The relations between the estimators and the performance bounds, as well as their properties, are discussed. Finally, we demonstrate the performance of the proposed estimators and bounds via simulations of estimation after channel selection. We show that the proposed performance bounds are more informative than the oracle Cram

r-Rao Bound (CRB), where the third interpretation (selective inference) results in the lowest mean-squared-error (MSE) among the estimators.

Paper Structure (34 sections, 2 theorems, 102 equations, 3 figures)

This paper contains 34 sections, 2 theorems, 102 equations, 3 figures.

Introduction
Background
Post-Model-Selection Estimation
Model: Estimation After Model Selection
Post-Model-Selection Estimation as Estimation Under model misspecification
Naive Interpretation
Normalized Interpretation
Selective Inference Interpretation
Post-Model-Selection Estimators
Maximum Selected Likelihood (MSL) Estimator
Maximum Selected Normalized Likelihood (MSNL) Estimator
Post-Selection ML (PSML) Estimator
Post-Model-Selection Performance Analysis
Post-model-selection MSE
Post-Model-Selection Pseudo-True Parameter Vectors
...and 19 more sections

Key Result

Theorem 1

($k$th PS-MCRB) Let $f({\bf{x}};\boldsymbol \theta)$ be a general assumed pdf for a post-model-selection scheme with a selection rule $\Psi$ that satisfy cond1cond2cond3cond4. The $k$th-MSE of any finite variance, PSMS-unbiased estimator, $\hat{\boldsymbol \theta}^{(k)}$, satisfies where the $k$th-PS-MCRB is given by the pseudo-true parameter vector $\boldsymbol \vartheta^{(k)}$ and the post-mod

Figures (3)

Figure 1: Post-model-selection estimation scheme: first, a model is selected from a pool of candidate models based on a predetermined selection rule. Second, the unknown parameters of the selected model are estimated. Both stages are performed based on the same observation vector, ${\bf{x}}$.
Figure 2: The MSE of the MSL, MSNL, PSML, and the oracle MLs estimators, and PS-MCEBs and the oracle CRB versus the threshold, $\gamma$, where the true hypothesis is ${\cal H}_1$.
Figure 3: The bias (a) and MSE (b) of the MSL, MSNL, PSML, and the oracle MLs estimators, and PS-MCEBs and the oracle CRB (b) versus the threshold, $\gamma$, where the true hypothesis is ${\cal H}_2$.

Theorems & Definitions (14)

Definition 1
Definition 2
Remark 1
Remark 2
Remark 3
Definition 3
Definition 4
Claim 1
Claim 2
Theorem 1
...and 4 more

Non-Bayesian Post-Model-Selection Estimation as Estimation Under Model Misspecification

TL;DR

Abstract

Non-Bayesian Post-Model-Selection Estimation as Estimation Under Model Misspecification

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (14)