ARMAX identification of low rank graphical models
Wenqi Cao, Aming Li
TL;DR
This paper tackles the problem of identifying low-rank stochastic processes from noisy measurements by exploiting a sparse-plus-low-rank (S+L) structure in latent-variable graphical models. It proposes a two-stage framework: (i) use maximum-entropy covariance extension to estimate a time-series model for the low-rank latent variable $y_l$, and (ii) perform a maximum-likelihood ARMAX graphical estimation to recover the deterministic relation $H(z)$ linking $y_l$ to the observed data, with identifiability and consistency proven under suitable conditions. The approach yields a consistent ML estimator and an accurate denoising filter, demonstrated via simulations where it significantly outperforms traditional low-rank identification that ignores measurement noise. The results have practical implications for extracting meaningful, reduced-dimension structure from high-dimensional noisy systems and for identifying key latent factors directly from data.
Abstract
In large-scale systems, complex internal relationships are often present. Such interconnected systems can be effectively described by low rank stochastic processes. When identifying a predictive model of low rank processes from sampling data, the rank-deficient property of spectral densities is often obscured by the inevitable measurement noise in practice. However, existing low rank identification approaches often did not take noise into explicit consideration, leading to non-negligible inaccuracies even under weak noise. In this paper, we address the identification issue of low rank processes under measurement noise. We find that the noisy measurement model admits a sparse plus low rank structure in latent-variable graphical models. Specifically, we first decompose the problem into a maximum entropy covariance extension problem, and a low rank graphical estimation problem based on an autoregressive moving-average with exogenous input (ARMAX) model. To identify the ARMAX low rank graphical models, we propose an estimation approach based on maximum likelihood. The identifiability and consistency of this approach are proven under certain conditions. Simulation results confirm the reliable performance of the entire algorithm in both the parameter estimation and noisy data filtering.
