Robust and efficient estimation for the Generalized Extreme-Value distribution with application to flood frequency analysis in the UK
Nathan Huet, Ilaria Prosdocimi
TL;DR
The paper develops a robust estimation framework for the three-parameter Generalized Extreme-Value (GEV) distribution using the Minimum Density Power Divergence (MDPD). By minimizing the H_α criterion, the authors achieve estimators that interpolate between the efficient maximum likelihood estimator (α=0) and robust L^2-type methods (α>0), and prove consistency, asymptotic normality, and bounded influence for α>0. Simulation studies show that MDPD maintains competitive efficiency in uncontaminated settings while delivering superior robustness under contamination, with the trade-off controlled by α. A UK flood-frequency case study demonstrates the method’s practical advantage by downweighting influential low floods without discarding data, yielding distributional fits and tail inferences closer to PILF-removed references. Overall, the work offers a theoretically sound and practically useful alternative for extreme-value analysis in environmental settings with contaminated data.
Abstract
A common approach for modeling extremes, such as peak flow or high temperatures, is the three-parameter Generalized Extreme-Value distribution. This is typically fit to extreme observations, here defined as maxima over disjoint blocks. This results in limited sample sizes and consequently, the use of classic estimators, such as the maximum likelihood estimator, may be inappropriate, as they are highly sensitive to outliers. To address these limitations, we propose a novel robust estimator based on the minimization of the density power divergence, controlled by a tuning parameter $α$ that balances robustness and efficiency. When $α= 0$, our estimator coincides with the maximum likelihood estimator; when $α= 1$, it corresponds to the $L^2$ estimator, known for its robustness. We establish convenient theoretical properties of the proposed estimator, including its asymptotic normality and the boundedness of its influence function for $α> 0$. The practical efficiency of the method is demonstrated through empirical comparisons with the maximum likelihood estimator and other robust alternatives. Finally, we illustrate its relevance in a case study on flood frequency analysis in the UK and provide some general conclusions in Section 6.
