Kernel Smoothing for Bounded Copula Densities
Mathias N. Muia, Olivia Atutey, Mahmud Hasan
TL;DR
This work develops a two-stage, nonparametric kernel estimator for bivariate copula densities that remains well-behaved near the unit-square boundaries through a mirror-reflection technique. It provides rigorous bias-variance characterizations and proves uniform consistency and asymptotic normality, along with practical bandwidth selection strategies including AMISE-based rule-of-thumb and data-driven cross-validation (LSCV and BCV). Simulation and a real-data application to the Wisconsin Breast Cancer Diagnostic Dataset demonstrate that AMISE-optimal bandwidths perform robustly when margins have unbounded support, while LSCV offers asymptotic optimality and useful bandwidth tuning in practice. The study offers a principled framework for estimating bounded copula densities with boundary corrections, enabling reliable dependence modeling in applications where marginal supports vary in extent and shape.
Abstract
Nonparametric estimation of copula density functions using kernel estimators presents significant challenges. One issue is the potential unboundedness of certain copula density functions at the corners of the unit square. Another is the boundary bias inherent in kernel density estimation. This paper presents a kernel-based method for estimating bounded copula density functions, addressing boundary bias through the mirror-reflection technique. Optimal smoothing parameters are derived via Asymptotic Mean Integrated Squared Error (AMISE) minimization and cross-validation, with theoretical guarantees of consistency and asymptotic normality. Two kernel smoothing strategies are proposed: the rule-of-thumb approach and least squares cross-validation (LSCV). Simulation studies highlight the efficacy of the rule-of-thumb method in bandwidth selection for copulas with unbounded marginal supports. The methodology is further validated through an application to the Wisconsin Breast Cancer Diagnostic Dataset (WBCDD), where LSCV is used for bandwidth selection.
