Linear regression with known noise distribution up to a scale: The reward of not using the OLSE
Fadoua Balabdaoui, Justine Leclerc
TL;DR
The paper studies linear regression when the error distribution is known up to a scale parameter and shows that the joint MLE of the regression vector and the scale is feasible, consistent, and asymptotically normal, with estimators that improve upon OLSE in efficiency. It derives the asymptotic relative efficiency, demonstrating that the MLE is at least as efficient as OLSE and often substantially more efficient for non-Gaussian noise, quantified for three scale-family distributions. Through simulations across multiple noise families and dimensions, and a real fish-market dataset, the authors illustrate the practical gains in estimation accuracy and variance reduction. The work provides a rigorous framework for ML-based estimation under scale-family noise, with clear guidelines on when leveraging the noise structure yields meaningful improvements in regression analysis.
Abstract
While the ordinary least squares estimator (OLSE) is still the most used estimator in linear regression models, other estimators can be more efficient when the error distribution is not Gaussian. In this paper, our goal is to evaluate this efficiency in the case of the Maximum Likelihood estimator (MLE) when the noise distribution belongs to a scale family. Under some regularity conditions, we show that (β_n,s_n), the MLE of the unknown regression vector β_0 and the scale s_0 exists and give the expression of the asymptotic efficiency of β_n over the OLSE. For given three scale families of densities, we quantify the true statistical gain of the MLE as a function of their deviation from the Gaussian family. To illustrate the theory, we present simulation results for different settings and also compare the MLE to the OLSE for the real market fish dataset.
