HACSurv: A Hierarchical Copula-Based Approach for Survival Analysis with Dependent Competing Risks
Xin Liu, Weijia Zhang, Min-Ling Zhang
TL;DR
This work tackles survival analysis under dependent competing risks and informative censoring by learning Hierarchical Archimedean Copulas (HACs) to flexibly model asymmetric dependencies. HACSurv combines HAC structure learning with MONDE-based marginal survival estimation and derives a CIF-based prediction that accounts for dependence via $F_{k^*}(t^*) = 1 - \frac{C(\{u_i\})}{C(\{u_i\}_{i \neq k^*})}$, where $u_i = S_{T_i|X}(t^*|x^*)$. The authors show that jointly learning the HAC and marginals reduces bias in survival distributions and achieves state-of-the-art performance on synthetic and real-world datasets, while also offering insights into disease interdependencies. The method supports end-to-end training, flexible structure discovery, and even allows specifying HACs when prior knowledge is available; code is provided for reproducibility.
Abstract
In survival analysis, subjects often face competing risks; for example, individuals with cancer may also suffer from heart disease or other illnesses, which can jointly influence the prognosis of risks and censoring. Traditional survival analysis methods often treat competing risks as independent and fail to accommodate the dependencies between different conditions. In this paper, we introduce HACSurv, a survival analysis method that learns Hierarchical Archimedean Copulas structures and cause-specific survival functions from data with competing risks. HACSurv employs a flexible dependency structure using hierarchical Archimedean copulas to represent the relationships between competing risks and censoring. By capturing the dependencies between risks and censoring, HACSurv improves the accuracy of survival predictions and offers insights into risk interactions. Experiments on synthetic dataset demonstrate that our method can accurately identify the complex dependency structure and precisely predict survival distributions, whereas the compared methods exhibit significant deviations between their predictions and the true distributions. Experiments on multiple real-world datasets also demonstrate that our method achieves better survival prediction compared to previous state-of-the-art methods.
