Self-Supervised Dynamical System Representations for Physiological Time-Series
Yenho Chen, Maxwell A. Xu, James M. Rehg, Christopher J. Rozell
TL;DR
Physiological time-series SSL often conflates underlying system dynamics with sample-specific noise. The authors propose PULSE, a cross-reconstruction objective grounded in a dynamical-systems generative model to recover shared system parameters while discarding sample-specific factors, with theoretical conditions for successful recovery. Synthetic experiments on chaotic systems validate the approach, and real-world datasets demonstrate improved linear probe performance, label efficiency, and transferability across domains. By explicitly prioritizing system information, PULSE achieves robust representations that generalize better to downstream tasks than conventional SSL methods.
Abstract
The effectiveness of self-supervised learning (SSL) for physiological time series depends on the ability of a pretraining objective to preserve information about the underlying physiological state while filtering out unrelated noise. However, existing strategies are limited due to reliance on heuristic principles or poorly constrained generative tasks. To address this limitation, we propose a pretraining framework that exploits the information structure of a dynamical systems generative model across multiple time-series. This framework reveals our key insight that class identity can be efficiently captured by extracting information about the generative variables related to the system parameters shared across similar time series samples, while noise unique to individual samples should be discarded. Building on this insight, we propose PULSE, a cross-reconstruction-based pretraining objective for physiological time series datasets that explicitly extracts system information while discarding non-transferrable sample-specific ones. We establish theory that provides sufficient conditions for the system information to be recovered, and empirically validate it using a synthetic dynamical systems experiment. Furthermore, we apply our method to diverse real-world datasets, demonstrating that PULSE learns representations that can broadly distinguish semantic classes, increase label efficiency, and improve transfer learning.
