Quantum Bayesian Networks for Machine Learning in Oil-Spill Detection
Owais Ishtiaq Siddiqui, Nouhaila Innan, Alberto Marchisio, Mohamed Bennai, Muhammad Shafique
TL;DR
The paper tackles oil-spill detection from satellite imagery under imbalanced data and limited quantum hardware by introducing Quantum Bayesian Networks (QBNs) that fuse probabilistic reasoning with quantum state preparation. It demonstrates the standalone strength of QBNs in precision/recall and shows that hybrid QBNs combined with classical ML models can achieve near-optimal AUCs ($\approx 0.99$), albeit with AUC limitations when using QBNs alone due to imbalance. Experimental evaluation includes a 48-feature setup mapped to $N$-qubit circuits, experiments on a 127-qubit IBM device, and Qiskit-based simulations, highlighting both the potential and the hardware-induced constraints of quantum-assisted environmental monitoring. The work provides a practical pathway for quantum-enhanced anomaly detection in ocean surveillance and outlines future optimization needs for robust, scalable deployment.
Abstract
Quantum Machine Learning (QML) has shown promise in diverse applications such as environmental monitoring, healthcare diagnostics, and financial modeling. However, its practical implementation faces challenges, including limited quantum hardware and the complexity of integrating quantum algorithms with classical systems. One critical challenge is handling imbalanced datasets, where rare events are often misclassified due to skewed data distributions. Quantum Bayesian Networks (QBNs) address this issue by enhancing feature extraction and improving the classification of rare events such as oil spills. This paper introduces a Bayesian approach utilizing QBNs to classify satellite-derived imbalanced datasets, distinguishing ``oil-spill'' from ``non-spill'' regions. QBNs leverage probabilistic reasoning and quantum state preparation to integrate quantum enhancements into classical machine learning architectures. Our approach achieves a 0.99 AUC score, demonstrating its efficacy in anomaly detection and advancing precise environmental monitoring and management. While integration enhances classification performance, dataset-specific challenges require further optimization.
