Adaptive tail index estimation: minimal assumptions and non-asymptotic guarantees

Johannes Lederer; Anne Sabourin; Mahsa Taheri

Adaptive tail index estimation: minimal assumptions and non-asymptotic guarantees

Johannes Lederer, Anne Sabourin, Mahsa Taheri

TL;DR

This work tackles the challenge of selecting the threshold $k$ for tail index inference using the Hill estimator under minimal assumptions. It introduces Extreme Adaptive Validation (EAV), an adaptive rule on a compact grid that relies on a transparent bias-variance decomposition and explicit variance quantiles, avoiding second-order or von Mises calibrations. Theoretical guarantees show that EAV matches the oracle error up to a small factor and achieves near-minimax rates under von Mises conditions, while remaining robust under only regular variation. Empirically, EAV outperforms existing adaptive schemes on ill-behaved tails and remains competitive on well-behaved tails, with substantially reduced computational complexity due to the grid restriction. Overall, the method provides a practical, assumption-light, and scalable approach for adaptive tail estimation with strong non-asymptotic guarantees.

Abstract

A notoriously difficult challenge in extreme value theory is the choice of the number $k\ll n$, where $n$ is the total sample size, of extreme data points to consider for inference of tail quantities. Existing theoretical guarantees for adaptive methods typically require second-order assumptions or von Mises assumptions that are difficult to verify and often come with tuning parameters that are challenging to calibrate. This paper revisits the problem of adaptive selection of $k$ for the Hill estimator. Our goal is not an `optimal' $k$ but one that is `good enough', in the sense that we strive for non-asymptotic guarantees that might be sub-optimal but are explicit and require minimal conditions. We propose a transparent adaptive rule that does not require preliminary calibration of constants, inspired by `adaptive validation' developed in high-dimensional statistics. A key feature of our approach is the consideration of a grid for $k$ of size $ \ll n $, which aligns with common practice among practitioners but has remained unexplored in theoretical analysis. Our rule only involves an explicit expression of a variance-type term; in particular, it does not require controlling or estimating a biasterm. Our theoretical analysis is valid for all heavy-tailed distributions, specifically for all regularly varying survival functions. Furthermore, when von Mises conditions hold, our method achieves `almost' minimax optimality with a rate of $\sqrt{\log \log n}~ n^{-|ρ|/(1+2|ρ|)}$ when the grid size is of order $\log n$, in contrast to the $ (\log \log (n)/n)^{|ρ|/(1+2|ρ|)} $ rate in existing work. Our simulations show that our approach performs particularly well for ill-behaved distributions.

Adaptive tail index estimation: minimal assumptions and non-asymptotic guarantees

TL;DR

This work tackles the challenge of selecting the threshold

for tail index inference using the Hill estimator under minimal assumptions. It introduces Extreme Adaptive Validation (EAV), an adaptive rule on a compact grid that relies on a transparent bias-variance decomposition and explicit variance quantiles, avoiding second-order or von Mises calibrations. Theoretical guarantees show that EAV matches the oracle error up to a small factor and achieves near-minimax rates under von Mises conditions, while remaining robust under only regular variation. Empirically, EAV outperforms existing adaptive schemes on ill-behaved tails and remains competitive on well-behaved tails, with substantially reduced computational complexity due to the grid restriction. Overall, the method provides a practical, assumption-light, and scalable approach for adaptive tail estimation with strong non-asymptotic guarantees.

Abstract

A notoriously difficult challenge in extreme value theory is the choice of the number

, where

is the total sample size, of extreme data points to consider for inference of tail quantities. Existing theoretical guarantees for adaptive methods typically require second-order assumptions or von Mises assumptions that are difficult to verify and often come with tuning parameters that are challenging to calibrate. This paper revisits the problem of adaptive selection of

for the Hill estimator. Our goal is not an `optimal'

but one that is `good enough', in the sense that we strive for non-asymptotic guarantees that might be sub-optimal but are explicit and require minimal conditions. We propose a transparent adaptive rule that does not require preliminary calibration of constants, inspired by `adaptive validation' developed in high-dimensional statistics. A key feature of our approach is the consideration of a grid for

of size

, which aligns with common practice among practitioners but has remained unexplored in theoretical analysis. Our rule only involves an explicit expression of a variance-type term; in particular, it does not require controlling or estimating a biasterm. Our theoretical analysis is valid for all heavy-tailed distributions, specifically for all regularly varying survival functions. Furthermore, when von Mises conditions hold, our method achieves `almost' minimax optimality with a rate of

when the grid size is of order

, in contrast to the

rate in existing work. Our simulations show that our approach performs particularly well for ill-behaved distributions.

Adaptive tail index estimation: minimal assumptions and non-asymptotic guarantees

TL;DR

Abstract

Adaptive tail index estimation: minimal assumptions and non-asymptotic guarantees

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (21)