Causal Customer Churn Analysis with Low-rank Tensor Block Hazard Model
Chenyin Gao, Zhiming Zhang, Shu Yang
TL;DR
The paper tackles causal churn analysis with multiple binary interventions by framing churn trajectories as a three-way tensor $\mathcal{Y}=(Y_{i,t,l})$ and imputing counterfactuals through a 1-bit tensor completion in a tensorized latent factor block hazard model. It combines a low-rank representation of the parameter tensor $\Theta$ with a clustering mechanism on the treatment mode to reduce the intervention space, and it employs IPTW with CBPS-based propensity scores alongside covariate-assisted loading for estimation. The authors establish non-asymptotic error bounds for both parameter estimation and clustering, and they validate the approach via synthetic experiments and a real bank churn dataset, demonstrating superior predictive performance and practical guidance for selecting homogeneous intervention groups. The framework supports identifying individualized optimal interventions to maximize retention and provides a scalable, theoretically-grounded approach to causal churn analysis with binary, time-to-event data.
Abstract
This study introduces an innovative method for analyzing the impact of various interventions on customer churn, using the potential outcomes framework. We present a new causal model, the tensorized latent factor block hazard model, which incorporates tensor completion methods for a principled causal analysis of customer churn. A crucial element of our approach is the formulation of a 1-bit tensor completion for the parameter tensor. This captures hidden customer characteristics and temporal elements from churn records, effectively addressing the binary nature of churn data and its time-monotonic trends. Our model also uniquely categorizes interventions by their similar impacts, enhancing the precision and practicality of implementing customer retention strategies. For computational efficiency, we apply a projected gradient descent algorithm combined with spectral clustering. We lay down the theoretical groundwork for our model, including its non-asymptotic properties. The efficacy and superiority of our model are further validated through comprehensive experiments on both simulated and real-world applications.
