Table of Contents
Fetching ...

Interpretable Transformer Hawkes Processes: Unveiling Complex Interactions in Social Networks

Zizhuo Meng, Ke Wan, Yadong Huang, Zhidong Li, Yang Wang, Feng Zhou

TL;DR

This work tackles the interpretability and expressiveness gap in deep point processes for social networks by introducing interpretable Transformer Hawkes Processes (ITHP). ITHP modifies Transformer Hawkes Process to explicitly capture interactions between event types and to model non-event intervals via a fully attention based intensity, aligning with nonlinear Hawkes kernels. The approach uses concatenated embeddings, a simplified attention mechanism, and a time varying trigger kernel to provide interpretable intra-type influences and flexible intensity dynamics, with training via maximum likelihood. Empirical results on synthetic and public datasets show that ITHP achieves competitive predictive performance while offering rich interpretability through learned interaction kernels and attention maps, enabling insights into user or group influences in social networks.

Abstract

Social networks represent complex ecosystems where the interactions between users or groups play a pivotal role in information dissemination, opinion formation, and social interactions. Effectively harnessing event sequence data within social networks to unearth interactions among users or groups has persistently posed a challenging frontier within the realm of point processes. Current deep point process models face inherent limitations within the context of social networks, constraining both their interpretability and expressive power. These models encounter challenges in capturing interactions among users or groups and often rely on parameterized extrapolation methods when modelling intensity over non-event intervals, limiting their capacity to capture intricate intensity patterns, particularly beyond observed events. To address these challenges, this study proposes modifications to Transformer Hawkes processes (THP), leading to the development of interpretable Transformer Hawkes processes (ITHP). ITHP inherits the strengths of THP while aligning with statistical nonlinear Hawkes processes, thereby enhancing its interpretability and providing valuable insights into interactions between users or groups. Additionally, ITHP enhances the flexibility of the intensity function over non-event intervals, making it better suited to capture complex event propagation patterns in social networks. Experimental results, both on synthetic and real data, demonstrate the effectiveness of ITHP in overcoming the identified limitations. Moreover, they highlight ITHP's applicability in the context of exploring the complex impact of users or groups within social networks.

Interpretable Transformer Hawkes Processes: Unveiling Complex Interactions in Social Networks

TL;DR

This work tackles the interpretability and expressiveness gap in deep point processes for social networks by introducing interpretable Transformer Hawkes Processes (ITHP). ITHP modifies Transformer Hawkes Process to explicitly capture interactions between event types and to model non-event intervals via a fully attention based intensity, aligning with nonlinear Hawkes kernels. The approach uses concatenated embeddings, a simplified attention mechanism, and a time varying trigger kernel to provide interpretable intra-type influences and flexible intensity dynamics, with training via maximum likelihood. Empirical results on synthetic and public datasets show that ITHP achieves competitive predictive performance while offering rich interpretability through learned interaction kernels and attention maps, enabling insights into user or group influences in social networks.

Abstract

Social networks represent complex ecosystems where the interactions between users or groups play a pivotal role in information dissemination, opinion formation, and social interactions. Effectively harnessing event sequence data within social networks to unearth interactions among users or groups has persistently posed a challenging frontier within the realm of point processes. Current deep point process models face inherent limitations within the context of social networks, constraining both their interpretability and expressive power. These models encounter challenges in capturing interactions among users or groups and often rely on parameterized extrapolation methods when modelling intensity over non-event intervals, limiting their capacity to capture intricate intensity patterns, particularly beyond observed events. To address these challenges, this study proposes modifications to Transformer Hawkes processes (THP), leading to the development of interpretable Transformer Hawkes processes (ITHP). ITHP inherits the strengths of THP while aligning with statistical nonlinear Hawkes processes, thereby enhancing its interpretability and providing valuable insights into interactions between users or groups. Additionally, ITHP enhances the flexibility of the intensity function over non-event intervals, making it better suited to capture complex event propagation patterns in social networks. Experimental results, both on synthetic and real data, demonstrate the effectiveness of ITHP in overcoming the identified limitations. Moreover, they highlight ITHP's applicability in the context of exploring the complex impact of users or groups within social networks.
Paper Structure (28 sections, 3 theorems, 19 equations, 24 figures, 4 tables, 1 algorithm)

This paper contains 28 sections, 3 theorems, 19 equations, 24 figures, 4 tables, 1 algorithm.

Key Result

theorem 1

In new_feature_x, the concatenation operation enables us to explicitly capture the desired temporal and event type similarities, while simultaneously avoiding any cross-similarities between timestamps and event types.

Figures (24)

  • Figure 1: Trigger Kernel Recover (Exp)
  • Figure 2: Trigger Kernel Recover (Sin)
  • Figure 3: Intensity Recover (Exp)
  • Figure 4: Intensity Recover (Sin)
  • Figure 6: The attention weight matrix for events and grids in the case of exponential decay kernel. Horizontal axis: source point, Vertical axis: target point. It is evident that events have an impact on the future which decays over time. Grids within non-event intervals do not exert any influence as they are not actual events.
  • ...and 19 more figures

Theorems & Definitions (3)

  • theorem 1
  • theorem 2
  • corollary 1