A Classifier-Based Approach to Multi-Class Anomaly Detection for Astronomical Transients
Rithwik Gupta, Daniel Muthukrishna, Michelle Lochner
TL;DR
This work tackles the challenge of real-time anomaly detection in the time-domain astronomy deluge by repurposing the latent space of a GRU-based light-curve classifier. The authors introduce Multi-Class Isolation Forests (MCIF), which trains an isolation forest for each known class and uses the minimum score across classes to detect anomalies in a 100-dimensional latent space. On simulated ZTF-like data with 12,040 common transients and 54 anomalies, MCIF recovers about 75% of anomalies within the top roughly 15% of ranked candidates, with substantial robustness observed when anomalous calcium-rich transients are excluded. The approach mitigates interpolation needs, leverages inter-passband information, and shows promise for early, real-time identification of rare transients, offering practical utility for next-generation surveys such as LSST.
Abstract
Automating real-time anomaly detection is essential for identifying rare transients, with modern survey telescopes generating tens of thousands of alerts per night, and future telescopes, such as the Vera C. Rubin Observatory, projected to increase this number dramatically. Currently, most anomaly detection algorithms for astronomical transients rely either on hand-crafted features extracted from light curves or on features generated through unsupervised representation learning, coupled with standard anomaly detection algorithms. In this work, we introduce an alternative approach: using the penultimate layer of a neural network classifier as the latent space for anomaly detection. We then propose a novel method, Multi-Class Isolation Forests (\texttt{MCIF}), which trains separate isolation forests for each class to derive an anomaly score for a light curve from its latent space representation. This approach significantly outperforms a standard isolation forest. We also use a simpler input method for real-time transient classifiers which circumvents the need for interpolation and helps the neural network handle irregular sampling and model inter-passband relationships. Our anomaly detection pipeline identifies rare classes including kilonovae, pair-instability supernovae, and intermediate luminosity transients shortly after trigger on simulated Zwicky Transient Facility light curves. Using a sample of our simulations matching the population of anomalies expected in nature (54 anomalies and 12,040 common transients), our method discovered $41\pm3$ anomalies (~75% recall) after following up the top 2000 (~15%) ranked transients. Our novel method shows that classifiers can be effectively repurposed for real-time anomaly detection.
