Deep Kernel Aalen-Johansen Estimator: An Interpretable and Flexible Neural Net Framework for Competing Risks
Xiaobin Shen, George H. Chen
TL;DR
This work introduces the Deep Kernel Aalen-Johansen (DKAJ) estimator, a unified framework that delivers interpretable, kernel-based hazard modeling for competing risks by representing each data point as a weighted mixture of cluster exemplars whose AJ-estimated CIFs are stored per cluster. By learning a neural embedding and using a kernel to weight training points, DKAJ yields a conditional AJ-like CIF prediction for new individuals while enabling straightforward interpretability through cluster-level visualizations and exemplar contributions. The approach achieves competitive performance against state-of-the-art baselines on four standard datasets and provides practical visualization tools for understanding both group-level and individual-level risk trajectories. The paper also establishes connections to classical AJ theory via a likelihood-based interpretation and discusses extensions (kernel choices, clustering schemes, and multistate processes) that preserve interpretability while expanding flexibility.
Abstract
We propose an interpretable deep competing risks model called the Deep Kernel Aalen-Johansen (DKAJ) estimator, which generalizes the classical Aalen-Johansen nonparametric estimate of cumulative incidence functions (CIFs). Each data point (e.g., patient) is represented as a weighted combination of clusters. If a data point has nonzero weight only for one cluster, then its predicted CIFs correspond to those of the classical Aalen-Johansen estimator restricted to data points from that cluster. These weights come from an automatically learned kernel function that measures how similar any two data points are. On four standard competing risks datasets, we show that DKAJ is competitive with state-of-the-art baselines while being able to provide visualizations to assist model interpretation.
