A Flexible Modeling of Extremes in the Presence of Inliers
Shivshankar Nila, Ishapathik Das, N. Balakrishna
TL;DR
This work tackles extreme-value analysis for data with a mass at zero and a nontrivial tail by introducing FEVM IMM, a three-component mixture that simultaneously models inliers at zero, a bulk below a threshold, and a GPD tail above the threshold with the tail fraction as a parameter. It develops a complete-likelihood ML framework, derives the asymptotic distribution of the estimators, and provides explicit score functions for cases with nonzero and zero shape parameter $\xi$, while tying the threshold $u$ into the estimation process. Through extensive simulations and real-data applications, FEVM IMM yields reduced bias and MSE in key extreme-value parameters, improves threshold and tail estimation, and delivers better goodness-of-fit and risk measures compared with EVMM and FEVMM. The framework has practical impact for reliability, environmental, and epidemiological risk assessment and opens avenues for extensions such as multimodal bulk components, change-point models, and Bayesian estimation.
Abstract
Many random phenomena, including life-testing and environmental data, show positive values and excess zeros, which pose modeling challenges. In life testing, immediate failures result in zero lifetimes, often due to defects or poor quality, especially in electronics and clinical trials. These failures, called inliers at zero, are difficult to model using standard approaches. The presence and proportion of inliers may influence the accuracy of extreme value analysis, bias parameter estimates, or even lead to severe events or extreme effects, such as drought or crop failure. In such scenarios, a key issue in extreme value analysis is determining a suitable threshold to capture tail behaviour accurately. Although some extreme value mixture models address threshold and tail estimation, they often inadequately handle inliers, resulting in suboptimal results. Bulk model misspecification can affect the threshold, extreme value estimates, and, in particular, the tail proportion. There is no unified framework for defining extreme value mixture models, especially the tail proportion. This paper proposes a flexible model that handles extremes, inliers, and the tail proportion. Parameters are estimated using maximum likelihood estimation. Compared the proposed model estimates with the classical mean excess plot, parameter stability plot, and Pickands plot estimates. Theoretical results are established, and the proposed model outperforms traditional methods in both simulation studies and real data analysis.
