Cardinality-Regularized Hawkes-Granger Model
Tsuyoshi Idé, Georgios Kollias, Dzung T. Phan, Naoki Abe
TL;DR
This work addresses the problem of learning sparse Granger causality in Hawkes processes, where conventional likelihood-based approaches suffer from a pathological singularity that prevents true sparsity. It introduces a cardinality-regularized MM framework, termed L0 Hawkes, that enforces sparsity via an $\\ell_0$ penalty on the triggering matrix $\\mathsf{A}$ and yields semi-analytic instance- and type-level causal diagnoses through instance triggering probabilities $q_{n,i}$. By combining MM with Jensen bounds and an $\\epsilon$-sparsity strategy, the method achieves robust, interpretable sparse graphs while maintaining tractable optimization with complexity $\\mathcal{O}(N^2+D^2)$. The authors validate the approach on synthetic benchmarks and real-world data from power grids and cloud data centers, showing that L0 Hawkes achieves higher break-even accuracy and sparser, more meaningful causal structures than neural Granger or conventional sparse Hawkes models. Overall, the paper provides a principled, scalable framework for precise instance-wise causality in event data, with practical impact for prioritizing and diagnosing failures in complex systems.
Abstract
We propose a new sparse Granger-causal learning framework for temporal event data. We focus on a specific class of point processes called the Hawkes process. We begin by pointing out that most of the existing sparse causal learning algorithms for the Hawkes process suffer from a singularity in maximum likelihood estimation. As a result, their sparse solutions can appear only as numerical artifacts. In this paper, we propose a mathematically well-defined sparse causal learning framework based on a cardinality-regularized Hawkes process, which remedies the pathological issues of existing approaches. We leverage the proposed algorithm for the task of instance-wise causal event analysis, where sparsity plays a critical role. We validate the proposed framework with two real use-cases, one from the power grid and the other from the cloud data center management domain.
