Dynamic Feature Fusion: Combining Global Graph Structures and Local Semantics for Blockchain Fraud Detection
Zhang Sheng, Liangliang Song, Yanbin Wang
TL;DR
The paper tackles blockchain fraud detection by jointly modeling global graph structures and local semantic information from transaction data. It introduces ETH-GBERT, a dynamic multimodal fusion model that merges a GCN-based graph representation with a BERT-based semantic encoder through a gating mechanism, enabling adaptive fusion across inputs. A comprehensive data processing pipeline constructs the transaction graph, enriches temporal features with n-gram time differences, and preprocesses text data for semantic modeling, leading to state-of-the-art results on large Ethereum datasets. The work demonstrates that adaptive, modality-aware fusion of structural and semantic cues significantly improves detection accuracy, robustness, and scalability for securing blockchain systems.
Abstract
The advent of blockchain technology has facilitated the widespread adoption of smart contracts in the financial sector. However, current fraud detection methodologies exhibit limitations in capturing both global structural patterns within transaction networks and local semantic relationships embedded in transaction data. Most existing models focus on either structural information or semantic features individually, leading to suboptimal performance in detecting complex fraud patterns.In this paper, we propose a dynamic feature fusion model that combines graph-based representation learning and semantic feature extraction for blockchain fraud detection. Specifically, we construct global graph representations to model account relationships and extract local contextual features from transaction data. A dynamic multimodal fusion mechanism is introduced to adaptively integrate these features, enabling the model to capture both structural and semantic fraud patterns effectively. We further develop a comprehensive data processing pipeline, including graph construction, temporal feature enhancement, and text preprocessing. Experimental results on large-scale real-world blockchain datasets demonstrate that our method outperforms existing benchmarks across accuracy, F1 score, and recall metrics. This work highlights the importance of integrating structural relationships and semantic similarities for robust fraud detection and offers a scalable solution for securing blockchain systems.
