Table of Contents
Fetching ...

Extending Cox Proportional Hazards Model with Symbolic Non-Linear Log-Risk Functions for Survival Analysis

Jiaxiang Cheng, Guoqiang Hu

TL;DR

This work tackles the limitation of the Cox proportional hazards model's linear log-risk by introducing Generalized Cox Proportional Hazards (GCPH), which leverages Kolmogorov-Arnold Networks to learn a fully symbolic non-linear log-risk function $f(\mathbf{x}; \bm{\Phi}) = \sum_{v=1}^V \hat{\phi}_v(x_v)$. The method integrates a log-partial likelihood with a sparsity- and entropy-based regularization, and applies a post-training symbolification step to yield human-interpretable symbolic activations for each covariate. Experiments on synthetic and real-world survival datasets show that GCPH achieves competitive C-index and Brier scores while providing transparent, covariate-specific risk mappings, enhancing interpretability without sacrificing performance. This approach offers practical impact for domains requiring both accurate survival predictions and understandable risk explanations, addressing a key gap between deep non-linear models and traditional, interpretable survival analysis.

Abstract

The Cox proportional hazards (CPH) model has been widely applied in survival analysis to estimate relative risks across different subjects given multiple covariates. Traditional CPH models rely on a linear combination of covariates weighted with coefficients as the log-risk function, which imposes a strong and restrictive assumption, limiting generalization. Recent deep learning methods enable non-linear log-risk functions. However, they often lack interpretability due to the end-to-end training mechanisms. The implementation of Kolmogorov-Arnold Networks (KAN) offers new possibilities for extending the CPH model with fully transparent and symbolic non-linear log-risk functions. In this paper, we introduce Generalized Cox Proportional Hazards (GCPH) model, a novel method for survival analysis that leverages KAN to enable a non-linear mapping from covariates to survival outcomes in a fully symbolic manner. GCPH maintains the interpretability of traditional CPH models while allowing for the estimation of non-linear log-risk functions. Experiments conducted on both synthetic data and various public benchmarks demonstrate that GCPH achieves competitive performance in terms of prediction accuracy and exhibits superior interpretability compared to current state-of-the-art methods.

Extending Cox Proportional Hazards Model with Symbolic Non-Linear Log-Risk Functions for Survival Analysis

TL;DR

This work tackles the limitation of the Cox proportional hazards model's linear log-risk by introducing Generalized Cox Proportional Hazards (GCPH), which leverages Kolmogorov-Arnold Networks to learn a fully symbolic non-linear log-risk function . The method integrates a log-partial likelihood with a sparsity- and entropy-based regularization, and applies a post-training symbolification step to yield human-interpretable symbolic activations for each covariate. Experiments on synthetic and real-world survival datasets show that GCPH achieves competitive C-index and Brier scores while providing transparent, covariate-specific risk mappings, enhancing interpretability without sacrificing performance. This approach offers practical impact for domains requiring both accurate survival predictions and understandable risk explanations, addressing a key gap between deep non-linear models and traditional, interpretable survival analysis.

Abstract

The Cox proportional hazards (CPH) model has been widely applied in survival analysis to estimate relative risks across different subjects given multiple covariates. Traditional CPH models rely on a linear combination of covariates weighted with coefficients as the log-risk function, which imposes a strong and restrictive assumption, limiting generalization. Recent deep learning methods enable non-linear log-risk functions. However, they often lack interpretability due to the end-to-end training mechanisms. The implementation of Kolmogorov-Arnold Networks (KAN) offers new possibilities for extending the CPH model with fully transparent and symbolic non-linear log-risk functions. In this paper, we introduce Generalized Cox Proportional Hazards (GCPH) model, a novel method for survival analysis that leverages KAN to enable a non-linear mapping from covariates to survival outcomes in a fully symbolic manner. GCPH maintains the interpretability of traditional CPH models while allowing for the estimation of non-linear log-risk functions. Experiments conducted on both synthetic data and various public benchmarks demonstrate that GCPH achieves competitive performance in terms of prediction accuracy and exhibits superior interpretability compared to current state-of-the-art methods.

Paper Structure

This paper contains 30 sections, 18 equations, 8 figures, 2 tables.

Figures (8)

  • Figure 1: Comparison of learning abilities from non-linear relationships. The experiments are based on synthetic non-linear data, illustrated in (a). The models compared are (b) traditional CPH cox_regression_1972, (c) DCPH katzman_deepsurv_2018, and (d) our proposed model. This demonstrates the enhanced capability of proposed model in capturing non-linear relationships compared to existing methods.
  • Figure 2: Overview of proposed GCPH for survival analysis by extending CPH model with non-linear symbolic log-risk function. A typical survival dataset is presented and used for training the GCPH with a specialized loss function, including the log-partial likelihood function. Confidence interval is also illustrated for each symbolic activation function through multiple tests with different random seeds for initialization.
  • Figure 3: Box plots of C-Index evaluated on the prediction results by different models, on various datasets including (a) synthetic linear data, (b) synthetic non-linear data, (c) TRACE, (d) COLON, (e) RDATA, and (f) FRTCS.
  • Figure 4: Comparison of learning abilities from linear relationships. The experiments are based on synthetic non-linear data, illustrated in (a). The models compared are (b) traditional CPH cox_regression_1972, (c) DCPH katzman_deepsurv_2018, and (d) our proposed GCPH model.
  • Figure 5: Symbolic functions learned by GCPH ($a$-$b$) and GCPH-$l$ ($c$-$d$) compared to ground truth in the linear experiments. $\phi_1(x)=x$ and $\phi_2(x)=2x$, while $\hat{\phi}_1$ and $\hat{\phi}_2$ are the predicted ones with multiple runs.
  • ...and 3 more figures