Evaluating the Vulnerability of ML-Based Ethereum Phishing Detectors to Single-Feature Adversarial Perturbations
Ahod Alghuried, Ali Alkinoon, Abdulaziz Alghamdi, Soohyeon Choi, Manar Mohaisen, David Mohaisen
TL;DR
The paper tackles the vulnerability of ML-based Ethereum phishing detectors to simple adversarial perturbations, focusing on single-feature manipulations and using FGSM to probe model robustness. It compares RF, DT, and KNN performance under targeted and untargeted attacks and demonstrates significant declines in phishing and scamming detection, with DT and KNN being especially vulnerable. The authors propose adversarial training as a practical defense, showing that retraining with adversarial examples restores much of the lost accuracy, particularly when temporal and address features are combined with financial signals. The work provides actionable guidance for strengthening Ethereum transaction classifiers against adversarial threats, highlighting the importance of robust feature design and defensive training for real-world deployment.
Abstract
This paper explores the vulnerability of machine learning models to simple single-feature adversarial attacks in the context of Ethereum fraudulent transaction detection. Through comprehensive experimentation, we investigate the impact of various adversarial attack strategies on model performance metrics. Our findings, highlighting how prone those techniques are to simple attacks, are alarming, and the inconsistency in the attacks' effect on different algorithms promises ways for attack mitigation. We examine the effectiveness of different mitigation strategies, including adversarial training and enhanced feature selection, in enhancing model robustness and show their effectiveness.
