$L^1$ Estimation: On the Optimality of Linear Estimators
Leighton P. Barnes, Alex Dytso, Jingbo Liu, H. Vincent Poor
TL;DR
This work characterizes when the Bayes estimator under the $L^1$ loss is linear in the observation for the Gaussian-noise model. Using a convolution formulation and Fourier analysis on tempered distributions, it proves that the conditional median is linear if and only if $a\in[0,1)$ and $X$ is Gaussian with variance $\frac{a}{1-a}$; positivity of the associated measure is crucial to the analysis. The study reveals a phase transition across $L^p$ losses, with Gaussian priors uniquely inducing linearity for $p\in[1,2]$ and infinitely many priors enabling linearity for $p>2$, and it discusses extensions to Poisson-like noise, exponential-family models, and higher-dimensional settings, highlighting practical implications for prior design and estimator benchmarks in Bayesian estimation. Overall, the results complete the understanding of when linear Bayesian estimators arise under $L^1$ fidelity and connect to broader themes in convolution equations and exponential-family conjugacy.
Abstract
Consider the problem of estimating a random variable $X$ from noisy observations $Y = X+ Z$, where $Z$ is standard normal, under the $L^1$ fidelity criterion. It is well known that the optimal Bayesian estimator in this setting is the conditional median. This work shows that the only prior distribution on $X$ that induces linearity in the conditional median is Gaussian. Along the way, several other results are presented. In particular, it is demonstrated that if the conditional distribution $P_{X|Y=y}$ is symmetric for all $y$, then $X$ must follow a Gaussian distribution. Additionally, we consider other $L^p$ losses and observe the following phenomenon: for $p \in [1,2]$, Gaussian is the only prior distribution that induces a linear optimal Bayesian estimator, and for $p \in (2,\infty)$, infinitely many prior distributions on $X$ can induce linearity. Finally, extensions are provided to encompass noise models leading to conditional distributions from certain exponential families.
