Survival Analysis with Graph-Based Regularization for Predictors
Liyan Xie, Xi He, Pinar Keskinocak, Yao Xie
TL;DR
This work introduces a graph-regularized maximum partial likelihood approach for Cox survival models to address high-dimensional, correlated predictors. By encoding predictor relationships in a graph G and employing the norm $\|\boldsymbol{\beta}\|_{G,\bm{\tau}}$, the method jointly selects groups of related variables and improves prediction via a group-lasso–type formulation achieved through predictor duplication. The authors establish finite-sample recovery guarantees and asymptotic normality, and demonstrate through simulations and real organ transplantation data that incorporating graph structure yields better estimation accuracy and higher concordance (c-index) than classic regularizers. The approach is applicable beyond organ transplantation and provides a scalable framework for graph-informed variable selection in survival analysis, with clear paths for extending to weighted or donor–recipient networks. Practical impact includes more reliable identification of survival-related factors and improved risk prediction in settings with correlated predictors and right-censored data.
Abstract
We study the variable selection problem in survival analysis to identify the most important factors affecting survival time. Our method incorporates prior knowledge of mutual correlations among variables, represented through a graph. We utilize the Cox proportional hazard model with a graph-based regularizer for variable selection. We present a computationally efficient algorithm developed to solve the graph regularized maximum likelihood problem by establishing connections with the group lasso, and provide theoretical guarantees about the recovery error and asymptotic distribution of the proposed estimators. The improved performance of the proposed approach compared with existing methods are demonstrated in both synthetic and real organ transplantation datasets.
