Financial Fraud Detection with Entropy Computing
Babak Emami, Wesley Dyk, David Haycraft, Carrie Spear, Lac Nguyen, Nicholas Chancellor
TL;DR
This work proposes CVQBoost, a boosting algorithm that runs on the Dirac-3 Entropy Quantum Computing platform to solve a continuous-weight boosting problem for fraud detection. By formulating boosting as a continuous quadratic optimization with weights $w_i$ and mapping it to a Dirac-3 Hamiltonian via $J_{ij}$ and $C_i$, CVQBoost achieves competitive AUC against XGBoost while delivering substantially faster training on large-scale data. The results demonstrate favorable scalability across dataset size and feature dimensionality, with CVQBoost outperforming conventional approaches on synthetic data up to 70 million samples and showing robust performance under various class-balancing schemes. The study highlights the practical potential of EQC-based acceleration for high-dimensional, imbalanced classification tasks such as fraud detection, and suggests promising avenues for explainability through the analysis of weak classifiers.
Abstract
We introduce CVQBoost, a novel classification algorithm that leverages early hardware implementing Quantum Computing Inc's Entropy Quantum Computing (EQC) paradigm, Dirac-3 [Nguyen et. al. arXiv:2407.04512]. We apply CVQBoost to a fraud detection test case and benchmark its performance against XGBoost, a widely utilized ML method. Running on Dirac-3, CVQBoost demonstrates a significant runtime advantage over XGBoost, which we evaluate on high-performance hardware comprising up to 48 CPUs and four NVIDIA L4 GPUs using the RAPIDS AI framework. Our results show that CVQBoost maintains competitive accuracy (measured by AUC) while significantly reducing training time, particularly as dataset size and feature complexity increase. To assess scalability, we extend our study to large synthetic datasets ranging from 1M to 70M samples, demonstrating that CVQBoost on Dirac-3 is well-suited for large-scale classification tasks. These findings position CVQBoost as a promising alternative to gradient boosting methods, offering superior scalability and efficiency for high-dimensional ML applications such as fraud detection.
