Topology-Agnostic Detection of Temporal Money Laundering Flows in Billion-Scale Transactions
Haseeb Tariq, Marwan Hassani
TL;DR
This work tackles cross-bank money laundering detection by modeling transaction traces as a topology-agnostic, temporal graph and enriching it with higher-order relationships. The FaSTM∀N framework constructs a scalable temporal graph of sequential transactions, derives a 2nd-order representation to quantify dependencies, and applies weighted graph clustering (Leiden) to reveal significant anomalous communities. It demonstrates superior scalability and detection coverage against FlowScope and DBJ* on a dataset exceeding $1\times 10^9$ transactions, aided by a distributed Spark/GraphFrames implementation and a carefully chosen time window $\Delta w$ to bound search space. The approach enables efficient, multi-hop money-flow analytics across multiple banks, with practical AML implications for large-scale, real-world transaction networks.
Abstract
Money launderers exploit the weaknesses in detection systems by purposefully placing their ill-gotten money into multiple accounts, at different banks. That money is then layered and moved around among mule accounts to obscure the origin and the flow of transactions. Consequently, the money is integrated into the financial system without raising suspicion. Path finding algorithms that aim at tracking suspicious flows of money usually struggle with scale and complexity. Existing community detection techniques also fail to properly capture the time-dependent relationships. This is particularly evident when performing analytics over massive transaction graphs. We propose a framework (called FaSTMAN), adapted for domain-specific constraints, to efficiently construct a temporal graph of sequential transactions. The framework includes a weighting method, using 2nd order graph representation, to quantify the significance of the edges. This method enables us to distribute complex queries on smaller and densely connected networks of flows. Finally, based on those queries, we can effectively identify networks of suspicious flows. We extensively evaluate the scalability and the effectiveness of our framework against two state-of-the-art solutions for detecting suspicious flows of transactions. For a dataset of over 1 Billion transactions from multiple large European banks, the results show a clear superiority of our framework both in efficiency and usefulness.
