Modeling Extreme Events in the Presence of Inlier: A Mixture Approach
Shivshankar Nila, Ishapathik Das, N. Balakrishna
TL;DR
The paper tackles extreme-value analysis when inliers occur at zero by introducing the Extreme Value Inlier Mixture Model (EVIMM), which combines a degenerate mass at zero, a gamma bulk below a threshold $u$, and a generalized Pareto tail above $u$. Threshold uncertainty is incorporated by treating $u$ as an estimable parameter within a maximum-likelihood framework. The authors provide an explicit data-generation algorithm, derive the MLEs, and validate the approach through extensive simulations and real-data applications, showing EVIMM reduces bias and MSE and improves return-level estimates relative to EVMM, which neglects inliers. The work offers practical gains for reliability, environmental, and risk-management data where zero-inliers and extremes co-occur, and suggests future extensions to broader inlier patterns and Bayesian estimation.
Abstract
In many random phenomena, such as life-testing experiments and environmental data (like rainfall data), there are often positive values and an excess of zeros, which create modeling challenges. In life testing, immediate failures result in zero lifetimes, often due to defects or poor quality, especially in electronics and clinical trials. These failures, called zero inliers, are difficult to model using standard approaches. When studying extreme values in the above scenarios, a key issue is selecting an appropriate threshold for accurate tail approximation of the population using asymptotic models. While some extreme value mixture models address threshold estimation and tail approximation, conventional parametric and non-parametric bulk and generalised Pareto distribution (GPD) approaches often neglect inliers, leading to suboptimal results. This paper introduces a framework for modeling extreme events and inliers using the GPD, addressing threshold uncertainty and effectively capturing inliers at zero. The model's parameters are estimated using the maximum likelihood estimation (MLE) method, ensuring optimal precision. Through simulation studies and real-world applications, we demonstrate that the proposed model significantly outperforms the traditional methods, which typically neglect inliers at the origin.
