Event2Vec: A Geometric Approach to Learning Composable Representations of Event Sequences
Antonin Sulc
TL;DR
Event2Vec introduces a geometry-guided framework for learning composable representations of discrete event sequences. By enforcing an additive structure via a reconstruction loss and grounding semantics through a prediction objective, the Euclidean variant provably approaches an ideal additive form, while a hyperbolic variant using Möbius addition models hierarchical, tree-like data. Empirically, the approach yields interpretable trajectories and improves syntactic structure capture over Word2Vec on the Brown Corpus, with qualitative Life Path analyses highlighting analogical reasoning capabilities. The work demonstrates how simple geometric priors can yield mechanistically interpretable embeddings and motivates hybrid models that balance additive interpretability with richer dynamics.
Abstract
The study of neural representations, both in biological and artificial systems, is increasingly revealing the importance of geometric and topological structures. Inspired by this, we introduce Event2Vec, a novel framework for learning representations of discrete event sequences. Our model leverages a simple, additive recurrent structure to learn composable, interpretable embeddings. We provide a theoretical analysis demonstrating that, under specific training objectives, our model's learned representations in a Euclidean space converge to an ideal additive structure. This ensures that the representation of a sequence is the vector sum of its constituent events, a property we term the linear additive hypothesis. To address the limitations of Euclidean geometry for hierarchical data, we also introduce a variant of our model in hyperbolic space, which is naturally suited to embedding tree-like structures with low distortion. We present experiments to validate our hypothesis. Quantitative evaluation on the Brown Corpus yields a Silhouette score of 0.0564, outperforming a Word2Vec baseline (0.0215), demonstrating the model's ability to capture structural dependencies without supervision.
