The Causal Chambers: Real Physical Systems as a Testbed for AI Methodology
Juan L. Gamella, Jonas Peters, Peter Bühlmann
TL;DR
The paper addresses the scarcity of real-ground-truth data for validating AI methods by introducing two automated physical testbeds, the Wind Tunnel and Light Tunnel, termed Causal Chambers. These devices embody well-understood physics and allow controlled interventions to generate large, multi-modal datasets with causal ground-truth graphs that serve as rigorous benchmarks. The authors provide ground-truth causal graphs, diverse data modalities, and open-source hardware/software plus downloadable datasets to support tasks across causal discovery, out-of-distribution generalization, change-point detection, ICA, symbolic regression, and physics-informed ML. Case studies reveal strengths and failure modes of leading algorithms on real-world-like data, highlighting validation utility and open science. By offering an accessible, open platform for validation, the work aims to accelerate robust methodological development and reproducibility in AI.
Abstract
In some fields of AI, machine learning and statistics, the validation of new methods and algorithms is often hindered by the scarcity of suitable real-world datasets. Researchers must often turn to simulated data, which yields limited information about the applicability of the proposed methods to real problems. As a step forward, we have constructed two devices that allow us to quickly and inexpensively produce large datasets from non-trivial but well-understood physical systems. The devices, which we call causal chambers, are computer-controlled laboratories that allow us to manipulate and measure an array of variables from these physical systems, providing a rich testbed for algorithms from a variety of fields. We illustrate potential applications through a series of case studies in fields such as causal discovery, out-of-distribution generalization, change point detection, independent component analysis, and symbolic regression. For applications to causal inference, the chambers allow us to carefully perform interventions. We also provide and empirically validate a causal model of each chamber, which can be used as ground truth for different tasks. All hardware and software is made open source, and the datasets are publicly available at causalchamber.org or through the Python package causalchamber.
