Consensus based Algorithm for Nonparametric Detection of Star Clusters (CANDiSC)
C O Obasi, J G Fernandez Trincado, M Gomez, D Minniti, J Alonso Garcia, B P L Ferreira, E R Garro, B Dias, R K Saito, B Barbuy, M C Parisi, T Palma, B Tang, M Ortigoza Urdaneta, L D Baravalle, M V Alonso, F Mauro
TL;DR
CANDiSC addresses the challenge of detecting star clusters in the crowded, highly extincted inner Galaxy by combining three nonparametric density estimators—Kernel Density Estimation, DBSCAN, and Nearest-Neighbor Density Estimation—within a consensus framework and applying it to the VVVX survey. The method processes 680 tiles across ≈1100 deg², using a color-magnitude filter and a tessellated spatial grid to identify overdensities that are confirmed only when at least two methods agree. Validation on real VVVX data and synthetic injections shows high astrometric accuracy, a false-positive rate below 5%, and the ability to recover known clusters while revealing about 40 new candidates; the approach is scalable to future surveys and can be enhanced with multi-band priors and Gaia data. Overall, CANDiSC provides a robust, automated, and adaptable tool for mining deep infrared photometric surveys to build a more complete census of Galactic star clusters, especially in regions of high extinction and crowding.
Abstract
Context: The VISTA Variables in the Via Lactea (VVV) and its extension (VVVX) are near-infrared surveys mapping the Galactic bulge and adjacent disk. These data have enabled the discovery of numerous star clusters obscured by high and spatially variable extinction. Most previous searches relied on visual inspection of individual tiles, which is inefficient and biased against faint or low-density systems. Aims: We aim to develop an automated, homogeneous algorithm for systematic cluster detection across different surveys. Here, we apply our method to VVVX data covering low-latitude regions of the Galactic bulge and disk, affected by extinction and crowding. Methods: We introduce the Consensus-based Algorithm for Nonparametric Detection of Star Clusters (CANDiSC), which integrates kernel density estimation, the Density-Based Spatial Clustering of Applications with Noise (DBSCAN), and nearest-neighbour density estimation within a consensus framework. A stellar overdensity is classified as a candidate if identified by at least two of these methods. We apply CANDiSC to 680 tiles in the VVVX PSF photometric catalogue, covering approximately 1100 square degrees. Results: We detect 163 stellar overdensities, of which 118 are known clusters. Cross-matching with recent catalogues yields five additional matches, leaving 40 likely new candidates absent from existing compilations. The estimated false-positive rate is below 5 percent. Conclusions: CANDiSC offers a robust and scalable approach for detecting stellar clusters in deep near-infrared surveys, successfully recovering known systems and revealing new candidates in the obscured and crowded regions of the Galactic plane.
