Causal Reasoning from Meta-reinforcement Learning
Ishita Dasgupta, Jane Wang, Silvia Chiappa, Jovana Mitrovic, Pedro Ortega, David Raposo, Edward Hughes, Peter Battaglia, Matthew Botvinick, Zeb Kurth-Nelson
TL;DR
This paper investigates whether causal reasoning can emerge from meta-reinforcement learning by training an RNN-based agent with model-free RL on tasks implemented as causal Bayesian networks. The approach enables an inner learning procedure that infers causal structure from data and performs interventions, observations, and counterfactuals in three data settings (observational, interventional, counterfactual). Across these settings, the agents demonstrate do-calculus-like reasoning, resolve unobserved confounders via interventions, and perform counterfactual predictions, all while learning to actively design informative experiments. The results show that end-to-end meta-learning can yield robust causal reasoning in complex, unseen causal graphs, with potential benefits for structured exploration and scalable, real-time reasoning in reinforcement learning contexts.
Abstract
Discovering and exploiting the causal structure in the environment is a crucial challenge for intelligent agents. Here we explore whether causal reasoning can emerge via meta-reinforcement learning. We train a recurrent network with model-free reinforcement learning to solve a range of problems that each contain causal structure. We find that the trained agent can perform causal reasoning in novel situations in order to obtain rewards. The agent can select informative interventions, draw causal inferences from observational data, and make counterfactual predictions. Although established formal causal reasoning algorithms also exist, in this paper we show that such reasoning can arise from model-free reinforcement learning, and suggest that causal reasoning in complex settings may benefit from the more end-to-end learning-based approaches presented here. This work also offers new strategies for structured exploration in reinforcement learning, by providing agents with the ability to perform -- and interpret -- experiments.
