Competing-risk Weibull survival model with multiple causes
Kai Wang, Yuqin Mu, Shenyi Zhang, Zhengjun Zhang, Chengxiu Ling
TL;DR
The paper addresses survival data with multiple contributing causes by introducing a competing-Weibull, min-structured model that yields time-varying, data-driven winning probabilities for each cause. It develops an EM-based maximum likelihood estimation framework with group-wise updates and sparsity penalties, and proves consistency and asymptotic normality under standard regularity conditions. Key contributions include the latent time representation, the explicit winning-probability formula, and the ability to identify informative biomarker combinations while handling high-dimensional covariates. The Alzheimer’s disease application demonstrates improved short-term prediction and interpretable biomarker ordering, highlighting the method’s potential for cause-mixture analysis and biomarker discovery in aging and neurodegenerative disease contexts.
Abstract
The failure of a system can result from the simultaneous effects of multiple causes, where assigning a specific cause may be inappropriate or unavailable. Examples include contributing causes of death in epidemiology and the aetiology of neurodegenerative diseases like Alzheimer's. We propose a parametric Weibull accelerated failure time model for multiple causes, incorporating a data-driven, individualized, and time-varying winning probability (relative importance) matrix. Using maximum likelihood estimation and the expectation-maximization (EM) algorithm, our approach enables simultaneous estimation of regression coefficients and relative cause importance, ensuring consistency and asymptotic normality. A simulation study and an application to Alzheimer's disease demonstrate its effectiveness in addressing cause-mixture problems and identifying informative biomarker combinations, with comparisons to Weibull and Cox proportional hazards models.
