Estimating counterfactual treatment outcomes over time in complex multiagent scenarios
Keisuke Fujii, Koh Takeuchi, Atsushi Kuribayashi, Naoya Takeishi, Yoshinobu Kawahara, Kazuya Takeda
TL;DR
The paper tackles the problem of estimating time-varying individual treatment effects (ITE) in complex, multiagent settings with hidden confounders, proposing a novel Theory-based Graph Variational Counterfactual Recurrent Network (TGV-CRN). The model integrates graph variational RNNs (GVRNN) for local inter-agent dynamics with theory-based global computations to enable long-term, interpretable counterfactual predictions of covariates and outcomes. Key contributions include the development of an interpretable counterfactual recurrent framework, incorporation of domain knowledge through theory-based functions, and empirical validation on synthetic CARLA and Boid data as well as real NBA basketball data, showing improved covariate prediction and more accurate timely interventions. This approach advances multiagent causal inference by providing actionable insights for when and how interventions are effective in real-world complex systems such as autonomous driving, biology, and sports.
Abstract
Evaluation of intervention in a multiagent system, e.g., when humans should intervene in autonomous driving systems and when a player should pass to teammates for a good shot, is challenging in various engineering and scientific fields. Estimating the individual treatment effect (ITE) using counterfactual long-term prediction is practical to evaluate such interventions. However, most of the conventional frameworks did not consider the time-varying complex structure of multiagent relationships and covariate counterfactual prediction. This may lead to erroneous assessments of ITE and difficulty in interpretation. Here we propose an interpretable, counterfactual recurrent network in multiagent systems to estimate the effect of the intervention. Our model leverages graph variational recurrent neural networks and theory-based computation with domain knowledge for the ITE estimation framework based on long-term prediction of multiagent covariates and outcomes, which can confirm the circumstances under which the intervention is effective. On simulated models of an automated vehicle and biological agents with time-varying confounders, we show that our methods achieved lower estimation errors in counterfactual covariates and the most effective treatment timing than the baselines. Furthermore, using real basketball data, our methods performed realistic counterfactual predictions and evaluated the counterfactual passes in shot scenarios.
