Detecting Toxic Flow
Álvaro Cartea, Gerardo Duran-Martin, Leandro Sánchez-Betancourt
TL;DR
This work tackles the problem of predicting toxic FX trades at the granularity of individual trades within broker–client flow.It introduces PULSE, an online Bayesian neural-network training method that decomposes hidden-layer parameters into a fixed subspace and a last layer, enabling rapid updates and uncertainty quantification.Empirically, PULSE outperforms logistic regression, random forests, and a recursively updated MLE in predicting toxic trades and yields higher broker PnL when combined with an internalise/externalise strategy.The results also show that a universal model with client features generally surpasses per-client models and that the approach scales to real-time deployment with meaningful practical impact for toxicity management in FX markets.
Abstract
This paper develops a framework to predict toxic trades that a broker receives from her clients. Toxic trades are predicted with a novel online learning Bayesian method which we call the projection-based unification of last-layer and subspace estimation (PULSE). PULSE is a fast and statistically-efficient Bayesian procedure for online training of neural networks. We employ a proprietary dataset of foreign exchange transactions to test our methodology. Neural networks trained with PULSE outperform standard machine learning and statistical methods when predicting if a trade will be toxic; the benchmark methods are logistic regression, random forests, and a recursively-updated maximum-likelihood estimator. We devise a strategy for the broker who uses toxicity predictions to internalise or to externalise each trade received from her clients. Our methodology can be implemented in real-time because it takes less than one millisecond to update parameters and make a prediction. Compared with the benchmarks, online learning of a neural network with PULSE attains the highest PnL and avoids the most losses by externalising toxic trades.
