Spectral clustering for dependent community Hawkes process models of temporal networks
Lingfei Zhao, Hadeel Soliman, Kevin S. Xu, Subhadeep Paul
TL;DR
This work develops a general Dependent Community Hawkes (DCH) framework that integrates stochastic block models with mutually exciting Hawkes processes to capture both community structure and dyadic dependence in temporal networks. It provides non-asymptotic spectral clustering guarantees on the count matrix, revealing how misclustering error scales with $n$, $K$, $T$, and the dependence parameter $\gamma_{\max}$, and shows consistency as $T\to\infty$. To balance flexibility and scalability, the authors introduce the Self and Reciprocal (SR) model within DCH and derive a consistent Generalized Method of Moments (GMM) estimator for its parameters under a restricted SR variant, complemented by a local refinement step to improve community assignments. The approach is validated through extensive simulations and real-data experiments, demonstrating competitive predictive performance and substantial computational efficiency compared to more flexible models like MULCH. Overall, the paper advances spectral clustering theory for dependent, weighted temporal networks and provides practical, scalable tools for joint community detection and Hawkes-parameter estimation.
Abstract
Temporal networks observed continuously over time through timestamped relational events data are commonly encountered in application settings including online social media communications, financial transactions, and international relations. Temporal networks often exhibit community structure and strong dependence patterns among node pairs. This dependence can be modeled through mutual excitations, where an interaction event from a sender to a receiver node increases the possibility of future events among other node pairs. We provide statistical results for a class of models that we call dependent community Hawkes (DCH) models, which combine the stochastic block model with mutually exciting Hawkes processes for modeling both community structure and dependence among node pairs, respectively. We derive a non-asymptotic upper bound on the misclustering error of spectral clustering on the event count matrix as a function of the number of nodes and communities, time duration, and the amount of dependence in the model. Our result leverages recent results on bounding an appropriate distance between a multivariate Hawkes process count vector and a Gaussian vector, along with results from random matrix theory. We also propose a DCH model that incorporates only self and reciprocal excitation along with highly scalable parameter estimation using a Generalized Method of Moments (GMM) estimator that we demonstrate to be consistent for growing network size and time duration.
