Identifying treatment response subgroups in observational time-to-event data
Vincent Jeanselme, Chang Ho Yoon, Fabian Falck, Brian Tom, Jessica Barrett
TL;DR
This work tackles the challenge of identifying patient subgroups with distinct treatment responses in observational time-to-event data. It introduces Causal Survival Clustering (CSC), a neural network–based framework that jointly learns latent subgroups and subgroup-specific survival distributions under treated and untreated conditions, using inverse propensity weighting to adjust for non-random treatment assignment. The approach formalizes Subgroup Average Treatment Effects (SATE) and leverages a monotonic survival network to flexibly model time-to-event outcomes without strong parametric assumptions. Through synthetic experiments and a SEER case study, CSC outperforms state-of-the-art baselines, particularly in observational settings, and provides data-driven guidance for selecting the number of subgroups via an elbow heuristic. The results offer a practical, hypothesis-generating tool to inform clinical trial design and treatment guidelines by revealing heterogeneous treatment responses across real-world patient populations.
Abstract
Identifying patient subgroups with different treatment responses is an important task to inform medical recommendations, guidelines, and the design of future clinical trials. Existing approaches for treatment effect estimation primarily rely on Randomised Controlled Trials (RCTs), which tend to feature more homogeneous patient groups, making them less relevant for uncovering subgroups in the population encountered in real-world clinical practice. Subgroup analyses established for RCTs suffer from significant statistical biases when applied to observational studies, which benefit from larger and more representative populations. Our work introduces a novel, outcome-guided, subgroup analysis strategy for identifying subgroups of treatment response in both RCTs and observational studies alike. It hence positions itself in-between individualised and average treatment effect estimation to uncover patient subgroups with distinct treatment responses, critical for actionable insights that may influence treatment guidelines. In experiments, our approach significantly outperforms the current state-of-the-art method for subgroup analysis in both randomised and observational treatment regimes.
