Overcoming Dependent Censoring in the Evaluation of Survival Models
Christian Marius Lillelund, Shi-ang Qi, Russell Greiner
TL;DR
The paper tackles bias in survival-model evaluation caused by dependent censoring, where $T$ and $C$ are not independent. It introduces three copula-based metrics, CI-Dep, IBS-Dep, and MAE-Dep, that incorporate Archimedean copulas via the Copula-Graphic framework to account for the dependence between $T$ and $C$, replacing KM-based components with CG-based estimates. A semi-synthetic data-generation framework is also developed to realistically simulate dependent censoring and enable robust benchmarking. Empirical results on synthetic and semi-synthetic data show that the proposed metrics reduce bias and provide more reliable model error estimates under dependent censoring, with performance depending on the strength of dependence and censoring rate.
Abstract
Conventional survival metrics, such as Harrell's concordance index (CI) and the Brier Score, rely on the independent censoring assumption for valid inference with right-censored data. However, in the presence of so-called dependent censoring, where the probability of censoring is related to the event of interest, these metrics can give biased estimates of the underlying model error. In this paper, we introduce three new evaluation metrics for survival analysis based on Archimedean copulas that can account for dependent censoring. We also develop a framework to generate realistic, semi-synthetic datasets with dependent censoring to facilitate the evaluation of the metrics. Our experiments in synthetic and semi-synthetic data demonstrate that the proposed metrics can provide more accurate estimates of the model error than conventional metrics under dependent censoring.
