Towards Convexity in Anomaly Detection: A New Formulation of SSLM with Unique Optimal Solutions
Hongying Liu, Hao Wang, Haoran Chu, Yibo Wu
TL;DR
This paper addresses nonconvexity in hypersphere-based anomaly detection models such as SSLM and SVDD by proposing a convex kernelized SSLM (CSSLM). For fixed hyperparameters, CSSLM reduces to a convex quadratic or linear program, enabling a global optimum and enabling rigorous analysis of existence, uniqueness, and dual structure, including a novel $\nu$-property that ties hyperparameters to support-vector and margin behavior. The authors establish conditions under which the optimal sphere center is unique and characterize the radius and margin via dual variables, linking CSSLM to traditional SSLM and SVDD as special cases. Empirical results on several datasets show CSSLM consistently achieves strong performance and stable convergence, while nonconvex SSLM often yields local optima and dual SSLM may fail to align with the primal optimum. The work highlights the practical impact of convex reformulations for scalable, reliable anomaly detection and opens avenues for efficient large-scale optimization and further theoretical properties.
Abstract
An unsolved issue in widely used methods such as Support Vector Data Description (SVDD) and Small Sphere and Large Margin SVM (SSLM) for anomaly detection is their nonconvexity, which hampers the analysis of optimal solutions in a manner similar to SVMs and limits their applicability in large-scale scenarios. In this paper, we introduce a novel convex SSLM formulation which has been demonstrated to revert to a convex quadratic programming problem for hyperparameter values of interest. Leveraging the convexity of our method, we derive numerous results that are unattainable with traditional nonconvex approaches. We conduct a thorough analysis of how hyperparameters influence the optimal solution, pointing out scenarios where optimal solutions can be trivially found and identifying instances of ill-posedness. Most notably, we establish connections between our method and traditional approaches, providing a clear determination of when the optimal solution is unique--a task unachievable with traditional nonconvex methods. We also derive the nu-property to elucidate the interactions between hyperparameters and the fractions of support vectors and margin errors in both positive and negative classes.
