Lifelong Continual Learning for Anomaly Detection: New Challenges, Perspectives, and Insights
Kamil Faber, Roberto Corizzo, Bartlomiej Sniezynski, Nathalie Japkowicz
TL;DR
The paper defines lifelong anomaly detection as the integration of continual adaptation with knowledge retention for anomaly detectors operating in evolving environments. It formalizes a scenario-generation and evaluation framework, including Lifelong ROC-AUC, backward transfer (BWT), and forward transfer (FWT), to systematically benchmark detectors under lifelong conditions. Experiments across multiple datasets reveal a clear gap between traditional non-lifelong anomaly detectors and lifelong approaches, with replay-based strategies mitigating forgetting and improving transfer, though upper-bound multi-task experts (MSTE) often still outperform them. The work demonstrates the practical value of lifelong learning for anomaly detection in domains such as cybersecurity and cyber-physical systems, and provides open-source tools to spur broader adoption and development.
Abstract
Anomaly detection is of paramount importance in many real-world domains, characterized by evolving behavior. Lifelong learning represents an emerging trend, answering the need for machine learning models that continuously adapt to new challenges in dynamic environments while retaining past knowledge. However, limited efforts are dedicated to building foundations for lifelong anomaly detection, which provides intrinsically different challenges compared to the more widely explored classification setting. In this paper, we face this issue by exploring, motivating, and discussing lifelong anomaly detection, trying to build foundations for its wider adoption. First, we explain why lifelong anomaly detection is relevant, defining challenges and opportunities to design anomaly detection methods that deal with lifelong learning complexities. Second, we characterize learning settings and a scenario generation procedure that enables researchers to experiment with lifelong anomaly detection using existing datasets. Third, we perform experiments with popular anomaly detection methods on proposed lifelong scenarios, emphasizing the gap in performance that could be gained with the adoption of lifelong learning. Overall, we conclude that the adoption of lifelong anomaly detection is important to design more robust models that provide a comprehensive view of the environment, as well as simultaneous adaptation and knowledge retention.
