Nonparametric hazard rate estimation with associated kernels and minimax bandwidth choice
Luce Breuil, Sarah Kaakaï
TL;DR
The paper develops a unified nonparametric hazard-rate estimation framework using associated kernels whose shapes depend on the estimation point, providing a second-order MISE expansion and an asymptotic normality result. It extends the Goldenshluger–Lepski oracle inequality to both pointwise and global minimax bandwidth selection for hazard-rate estimation, addressing challenges from unbounded kernel supports. The Gamma kernel is shown to satisfy the proposed assumptions, with explicit results for MISE and asymptotic distribution, and numerical experiments on simulated and drosophila aging data illustrate boundary-bias improvements and the practical value of minimax bandwidths. Overall, the work offers theoretical guarantees and practical tools for hazard-rate estimation with flexible, boundary-aware kernels and data-driven bandwidth selection, paving the way for designing new associated kernels in survival analysis.
Abstract
In this paper, we introduce a general theoretical framework for nonparametric hazard rate estimation using associated kernels, whose shapes depend on the point of estimation. Within this framework, we establish rigorous asymptotic results, including a second-order expansion of the MISE, and a central limit theorem for the proposed estimator. We also prove a new oracle-type inequality for both local and global minimax bandwidth selection, extending the Goldenshluger-Lepski method to the context of associated kernels. Our results propose a systematic way to construct and analyze new associated kernels. Finally, we show that the general framework applies to the Gamma kernel, and we provide several examples of applications on simulated data and experimental data for the study of aging.
