Time-Varying Graph Learning for Data with Heavy-Tailed Distribution
Amirhossein Javaheri, Jiaxi Ying, Daniel P. Palomar, Farokh Marvasti
TL;DR
This work tackles learning time-varying graphs under heavy-tailed data and missing entries by introducing a probabilistic framework that combines a non-negative vector autoregressive model for graph weights with a Laplacian-based Student-t signal model. It employs a semi-online MAP approach updating graphs within individual frames and enforces spectral constraints to enable $k$-component graph structures for clustering. An ADMM-based algorithm with majorization-minimization surrogates provides a convergent solution, and the method demonstrates superior performance on synthetic data and real financial data for both clustering and portfolio design. The approach is robust to outliers and missing data and supports near real-time frame-wise updates, making it practically impactful for finance and other dynamic networks.
Abstract
Graph models provide efficient tools to capture the underlying structure of data defined over networks. Many real-world network topologies are subject to change over time. Learning to model the dynamic interactions between entities in such networks is known as time-varying graph learning. Current methodology for learning such models often lacks robustness to outliers in the data and fails to handle heavy-tailed distributions, a common feature in many real-world datasets (e.g., financial data). This paper addresses the problem of learning time-varying graph models capable of efficiently representing heavy-tailed data. Unlike traditional approaches, we incorporate graph structures with specific spectral properties to enhance data clustering in our model. Our proposed method, which can also deal with noise and missing values in the data, is based on a stochastic approach, where a non-negative vector auto-regressive (VAR) model captures the variations in the graph and a Student-t distribution models the signal originating from this underlying time-varying graph. We propose an iterative method to learn time-varying graph topologies within a semi-online framework where only a mini-batch of data is used to update the graph. Simulations with both synthetic and real datasets demonstrate the efficacy of our model in analyzing heavy-tailed data, particularly those found in financial markets.
