Predictive Coding beyond Correlations
Tommaso Salvatori, Luca Pinchetti, Amine M'Charrak, Beren Millidge, Thomas Lukasiewicz
TL;DR
The paper investigates extending highly influential predictive coding (PC) networks beyond correlation-based inference to full causal reasoning. It shows how PC graphs can perform interventions without mutilating the graph by manipulating prediction errors, effectively implementing do-operations at runtime, and extends this to learn causal structures from observational data via continuous adjacency weights with acyclicity and sparsity priors. The authors provide theoretical links to Structural Causal Models and validate them through extensive experiments on synthetic DAGs and image classification tasks, demonstrating associational, interventional, and counterfactual reasoning, with improvements in MNIST/FashionMNIST when using interventional queries. Overall, this work bridges computational neuroscience and causality, proposing an end-to-end, transparent causal engine that jointly learns graph structure and performs causal queries, while acknowledging limitations related to Markov equivalence and data constraints.
Abstract
Recently, there has been extensive research on the capabilities of biologically plausible algorithms. In this work, we show how one of such algorithms, called predictive coding, is able to perform causal inference tasks. First, we show how a simple change in the inference process of predictive coding enables to compute interventions without the need to mutilate or redefine a causal graph. Then, we explore applications in cases where the graph is unknown, and has to be inferred from observational data. Empirically, we show how such findings can be used to improve the performance of predictive coding in image classification tasks, and conclude that such models are able to perform simple end-to-end causal inference tasks.
