High-Dimensional Markov-switching Ordinary Differential Processes
Katherine Tsai, Mladen Kolar, Sanmi Koyejo
TL;DR
This work tackles parameter recovery for high-dimensional Markov-switching ODEs with nonlinear additive dynamics from discretely observed data. It introduces a two-stage estimation approach that first reconstructs the continuous trajectory via wavelet smoothing and then applies a (truncated) EM algorithm to infer the transition-rate matrix $Q$ and additive-function coefficients, with theoretical guarantees under $β$-mixing. The authors derive convergence and error bounds for both the trajectory recovery and the EM procedure, and demonstrate graph-recovery performance and edge detection under realistic sample sizes. The method is validated on simulated data and applied to resting-state ADHD fMRI, where group differences in transition dynamics and state-specific connectomes are revealed, highlighting the approach’s practical impact for understanding time-varying brain networks.
Abstract
We investigate the parameter recovery of Markov-switching ordinary differential processes from discrete observations, where the differential equations are nonlinear additive models. This framework has been widely applied in biological systems, control systems, and other domains; however, limited research has been conducted on reconstructing the generating processes from observations. In contrast, many physical systems, such as human brains, cannot be directly experimented upon and rely on observations to infer the underlying systems. To address this gap, this manuscript presents a comprehensive study of the model, encompassing algorithm design, optimization guarantees, and quantification of statistical errors. Specifically, we develop a two-stage algorithm that first recovers the continuous sample path from discrete samples and then estimates the parameters of the processes. We provide novel theoretical insights into the statistical error and linear convergence guarantee when the processes are $β$-mixing. Our analysis is based on the truncation of the latent posterior processes and demonstrates that the truncated processes approximate the true processes under mixing conditions. We apply this model to investigate the differences in resting-state brain networks between the ADHD group and normal controls, revealing differences in the transition rate matrices of the two groups.
