Conformalized Survival Distributions: A Generic Post-Process to Increase Calibration
Shi-ang Qi, Yakun Yu, Russell Greiner
TL;DR
This paper tackles the persistence of calibration gaps in survival analysis without sacrificing discriminative accuracy. It introduces Conformalized Survival Distribution (CSD), a model-agnostic post-processing framework that uses conformal regression on discretized percentile times to recalibrate survival distributions while preserving discrimination. The authors prove theoretical guarantees—including distribution calibration and KM calibration—and validate the method on 11 real-world datasets, showing robust calibration gains with minimal or no loss in C-index. They also compare CSD to objective-based calibration approaches, demonstrate the benefits of KM-sampling for censoring, and provide practical guidance and code for practitioners. The work offers a scalable, theoretically grounded approach to producing reliable, calibrated survival distributions in the presence of censoring, with broad applicability in clinical decision making and resource allocation.
Abstract
Discrimination and calibration represent two important properties of survival analysis, with the former assessing the model's ability to accurately rank subjects and the latter evaluating the alignment of predicted outcomes with actual events. With their distinct nature, it is hard for survival models to simultaneously optimize both of them especially as many previous results found improving calibration tends to diminish discrimination performance. This paper introduces a novel approach utilizing conformal regression that can improve a model's calibration without degrading discrimination. We provide theoretical guarantees for the above claim, and rigorously validate the efficiency of our approach across 11 real-world datasets, showcasing its practical applicability and robustness in diverse scenarios.
