Recurrences reveal shared causal drivers of complex time series
William Gilpin
TL;DR
SHREC reframes driver reconstruction as a dynamics-inspired problem: an unobserved driver shapes multiple responses whose recurrences encode the driver’s states. It builds a consensus recurrence graph from adaptive, fuzzy proximity embeddings and traverses it via diffusion or Leiden-based community detection to recover a time-resolved driver. The method achieves strong, data-efficient performance across diverse domains, reveals higher-order interactions, and links reconstruction accuracy to percolation on the recurrence graph and to the dominant unstable periodic orbits of the driver. These results establish a physics-based, unsupervised approach for extracting shared causal forces from noisy observational data with broad applicability in biology, ecology, and engineering.
Abstract
Unmeasured causal forces influence diverse experimental time series, such as the transcription factors that regulate genes, or the descending neurons that steer motor circuits. Combining the theory of skew-product dynamical systems with topological data analysis, we show that simultaneous recurrence events across multiple time series reveal the structure of their shared unobserved driving signal. We introduce a physics-based unsupervised learning algorithm that reconstructs causal drivers by iteratively building a recurrence graph with glass-like structure. As the amount of data increases, a percolation transition on this graph leads to weak ergodicity breaking for random walks -- revealing the shared driver's dynamics, even from strongly-corrupted measurements. We relate reconstruction accuracy to the rate of information transfer from a chaotic driver to the response systems, and we find that effective reconstruction proceeds through gradual approximation of the driver's dynamical attractor. Through extensive benchmarks against classical signal processing and machine learning techniques, we demonstrate our method's ability to extract causal drivers from diverse experimental datasets spanning ecology, genomics, fluid dynamics, and physiology.
