On (in)consistency of M-estimators under contamination
Jens Klooster, Bent Nielsen
TL;DR
This paper studies the robustness of M-estimators for location and scale under fixed contamination, showing that non-redescending estimators like the median and Huber can be inconsistent under asymmetric contamination while redescending estimators like Tukey can be consistent when the scale is estimated consistently. It develops an asymptotic framework under a contamination model, deriving boundedness results, an oracle property for redescending estimators, and explicit inconsistency results for non-redescending ones; it also demonstrates that standard robust scale estimators (IQR, MAD) are themselves inconsistent under contamination. The practical implication is that valid inference under contamination requires modelling the contamination and/or using alternatives such as Least Trimmed Squares (LTS), which exhibits an oracle property and nuisance-parameter-free inference under the contamination model. The simulations corroborate that Tukey with a consistent scale can achieve consistency, but LTS often provides superior finite-sample performance across contaminated scenarios, underscoring the value of robust, contamination-aware approaches in location-scale problems.
Abstract
We consider robust location-scale estimators under contamination. We show that commonly used robust estimators such as the median and the Huber estimator are inconsistent under asymmetric contamination, while the Tukey estimator is consistent. In order to make nuisance parameter free inference based on the Tukey estimator a consistent scale estimator is required. However, standard robust scale estimators such as the interquartile range and the median absolute deviation are inconsistent under contamination.
