Reducing false positives in strong lens detection through effective augmentation and ensemble learning
Samira Rezaei, Amirmohammad Chegeni, Bharath Chowdhary Nagam, J. P. McKean, Mitra Baratchi, Koen Kuijken, Léon V. E. Koopmans
TL;DR
The paper tackles false positives in automated strong gravitational lens detection by examining how training-data quality and diversity shape CNN performance. It systematically compares Vanilla, Applied1, and Applied2 data strategies and employs ensembles of DenseNet and EfficientNet architectures to reduce false positives while preserving completeness. A Combined ensemble achieves a false-positive rate of $10^{-4}$ with a modest ~2.3 percent drop in true positives, illustrating a favorable FP-TP trade-off for large surveys like KiDS and Euclid. The work provides actionable guidance on data-design and ensembling to enable scalable, reliable lens candidate catalogs for current and upcoming cosmological surveys.
Abstract
This research studies the impact of high-quality training datasets on the performance of Convolutional Neural Networks (CNNs) in detecting strong gravitational lenses. We stress the importance of data diversity and representativeness, demonstrating how variations in sample populations influence CNN performance. In addition to the quality of training data, our results highlight the effectiveness of various techniques, such as data augmentation and ensemble learning, in reducing false positives while maintaining model completeness at an acceptable level. This enhances the robustness of gravitational lens detection models and advancing capabilities in this field. Our experiments, employing variations of DenseNet and EfficientNet, achieved a best false positive rate (FP rate) of $10^{-4}$, while successfully identifying over 88 per cent of genuine gravitational lenses in the test dataset. This represents an 11-fold reduction in the FP rate compared to the original training dataset. Notably, this substantial enhancement in the FP rate is accompanied by only a 2.3 per cent decrease in the number of true positive samples. Validated on the KiDS dataset, our findings offer insights applicable to ongoing missions, like Euclid.
