Adaptive Transformer Modelling of Density Function for Nonparametric Survival Analysis
Xin Zhang, Deval Mehta, Yanan Hu, Chao Zhu, David Darby, Zhen Yu, Daniel Merlo, Melissa Gresle, Anneke Van Der Walt, Helmut Butzkueven, Zongyuan Ge
TL;DR
This paper introduces UniSurv, a nonparametric survival model based on Transformer architecture that jointly handles static and dynamic data, missing values, and censoring while producing unimodal, calibrated PDFs. It leverages a novel Margin-Mean-Variance loss to guide the learned distribution, improves censoring prediction, and supports both static and longitudinal inputs through dedicated extraction branches. Comprehensive experiments on static and dynamic datasets show UniSurv achieves superior or competitive C-index, MAE-H, MAE-U, and mAUC, with robust PDF visualizations and insightful ablations. The approach offers a flexible, end-to-end framework for time-to-event prediction with practical implications for healthcare and other domains requiring reliable survival analysis.
Abstract
Survival analysis holds a crucial role across diverse disciplines, such as economics, engineering and healthcare. It empowers researchers to analyze both time-invariant and time-varying data, encompassing phenomena like customer churn, material degradation and various medical outcomes. Given the complexity and heterogeneity of such data, recent endeavors have demonstrated successful integration of deep learning methodologies to address limitations in conventional statistical approaches. However, current methods typically involve cluttered probability distribution function (PDF), have lower sensitivity in censoring prediction, only model static datasets, or only rely on recurrent neural networks for dynamic modelling. In this paper, we propose a novel survival regression method capable of producing high-quality unimodal PDFs without any prior distribution assumption, by optimizing novel Margin-Mean-Variance loss and leveraging the flexibility of Transformer to handle both temporal and non-temporal data, coined UniSurv. Extensive experiments on several datasets demonstrate that UniSurv places a significantly higher emphasis on censoring compared to other methods.
