Covariate-Adjusted Deep Causal Learning for Heterogeneous Panel Data Models
Guanhao Zhou, Yuefeng Han, Xiufan Yu
TL;DR
This work tackles unit-specific heterogeneous treatment effects in causal panel data with covariate influences and non-random missingness. It introduces Covariate-Adjusted Deep Causal Learning (CoDEAL), combining a nonlinear covariate adjustment via a deep neural network with a nonlinear latent-factor structure learned by a multi-output autoencoder to perform causal matrix completion under staggered adoption designs. The paper provides theoretical convergence guarantees for the estimated counterfactuals, and demonstrates through extensive simulations that CoDEAL consistently outperforms benchmark methods across linear and nonlinear factor and covariate settings. An empirical application to OxCGRT data on government interventions illustrates CoDEAL’s practical impact on counterfactual analysis in real-world policy contexts. The framework offers a scalable, unified approach to covariate-adjusted heterogeneous causal inference in panel data and can be extended to tensor data and multiple treatments in future work.
Abstract
This paper studies the task of estimating heterogeneous treatment effects in causal panel data models, in the presence of covariate effects. We propose a novel Covariate-Adjusted Deep Causal Learning (CoDEAL) for panel data models, that employs flexible model structures and powerful neural network architectures to cohesively deal with the underlying heterogeneity and nonlinearity of both panel units and covariate effects. The proposed CoDEAL integrates nonlinear covariate effect components (parameterized by a feed-forward neural network) with nonlinear factor structures (modeled by a multi-output autoencoder) to form a heterogeneous causal panel model. The nonlinear covariate component offers a flexible framework for capturing the complex influences of covariates on outcomes. The nonlinear factor analysis enables CoDEAL to effectively capture both cross-sectional and temporal dependencies inherent in the data panel. This latent structural information is subsequently integrated into a customized matrix completion algorithm, thereby facilitating more accurate imputation of missing counterfactual outcomes. Moreover, the use of a multi-output autoencoder explicitly accounts for heterogeneity across units and enhances the model interpretability of the latent factors. We establish theoretical guarantees on the convergence of the estimated counterfactuals, and demonstrate the compelling performance of the proposed method using extensive simulation studies and a real data application.
