Temporal fingerprints: Identity matching across fully encrypted domain
Shahar Somin, Keeley Erhardt, Alex 'Sandy' Pentland
TL;DR
The paper addresses cross-domain identity matching under privacy-preserving constraints by relying on individual temporal activity patterns. It combines an unsupervised affinity based on inter-event time distributions with a Temporal Graph Neural Network trained on daily KS-based similarity graphs to identify profile pairs across encrypted domains. On Ethereum data, it reports average AUC of $0.78$ and precision of $0.96$ for the top-100 matches, outperforming activity-overlap and REGAL baselines and showing robustness to noise. The work demonstrates that timing information can act as a persistent fingerprint across domains, highlighting privacy risks and informing defenses in privacy-preserving system design.
Abstract
Technological advancements have significantly transformed communication patterns, introducing a diverse array of online platforms, thereby prompting individuals to use multiple profiles for different domains and objectives. Enhancing the understanding of cross domain identity matching capabilities is essential, not only for practical applications such as commercial strategies and cybersecurity measures, but also for theoretical insights into the privacy implications of data disclosure. In this study, we demonstrate that individual temporal data, in the form of inter-event times distribution, constitutes an individual temporal fingerprint, allowing for matching profiles across different domains back to their associated real-world entity. We evaluate our methodology on encrypted digital trading platforms within the Ethereum Blockchain and present impressing results in matching identities across these privacy-preserving domains, while outperforming previously suggested models. Our findings indicate that simply knowing when an individual is active, even if information about who they talk to and what they discuss is lacking, poses risks to users' privacy, highlighting the inherent challenges in preserving privacy in today's digital landscape.
