Learn to Rank Risky Investors: A Case Study of Predicting Retail Traders' Behaviour and Profitability
Weixian Waylon Li, Tiejun Ma
TL;DR
This work reframes risky trader detection in CFD markets as a learning-to-rank problem, introducing PA-RiskRanker that optimizes Profit-Aware BCE loss and employs a Self-Cross-Trader Attention pipeline to capture intra- and inter-trader dynamics. By embedding financial profit signals directly into the ranking objective and modeling complex feature interactions, the approach achieves notable gains in both ranking metrics and monetary impact, surpassing state-of-the-art ranking models and traditional classification baselines. The study also adopts a two-step evaluation framework to bridge ranking signals with interpretable classifiers, demonstrating practical gains in predictive performance and decision transparency. The results suggest substantial practical value for market makers in real-time hedging decisions and highlight promising avenues for extending the framework to related financial risk domains and multimodal data.
Abstract
Identifying risky traders with high profits in financial markets is crucial for market makers, such as trading exchanges, to ensure effective risk management through real-time decisions on regulation compliance and hedging. However, capturing the complex and dynamic behaviours of individual traders poses significant challenges. Traditional classification and anomaly detection methods often establish a fixed risk boundary, failing to account for this complexity and dynamism. To tackle this issue, we propose a profit-aware risk ranker (PA-RiskRanker) that reframes the problem of identifying risky traders as a ranking task using Learning-to-Rank (LETOR) algorithms. Our approach features a Profit-Aware binary cross entropy (PA-BCE) loss function and a transformer-based ranker enhanced with a self-cross-trader attention pipeline. These components effectively integrate profit and loss (P&L) considerations into the training process while capturing intra- and inter-trader relationships. Our research critically examines the limitations of existing deep learning-based LETOR algorithms in trading risk management, which often overlook the importance of P&L in financial scenarios. By prioritising P&L, our method improves risky trader identification, achieving an 8.4% increase in F1 score compared to state-of-the-art (SOTA) ranking models like Rankformer. Additionally, it demonstrates a 10%-17% increase in average profit compared to all benchmark models.
