Loss Landscape Analysis for Reliable Quantized ML Models for Scientific Sensing
Tommaso Baldi, Javier Campos, Olivia Weng, Caleb Geniesse, Nhan Tran, Ryan Kastner, Alessandro Biondi
TL;DR
The paper tackles reliability of quantized ML models in scientific sensing by introducing a loss-landscape analysis framework that integrates visualization, CKA similarity, Hessian metrics, and mode connectivity. This approach enables a priori assessment of robustness to input and weight perturbations without exhaustive retraining, highlighting how quantization and regularization shape the loss surface. Key findings show that gently-shaped, flat minima correlate with robustness, that regularizers can improve or sometimes degrade robustness depending on the model, and that increasing precision does not universally enhance robustness. The work provides actionable guidance for including robustness in Pareto optimization, supporting more reliable and adaptive scientific sensing at the edge with efficient design workflows.
Abstract
In this paper, we propose a method to perform empirical analysis of the loss landscape of machine learning (ML) models. The method is applied to two ML models for scientific sensing, which necessitates quantization to be deployed and are subject to noise and perturbations due to experimental conditions. Our method allows assessing the robustness of ML models to such effects as a function of quantization precision and under different regularization techniques -- two crucial concerns that remained underexplored so far. By investigating the interplay between performance, efficiency, and robustness by means of loss landscape analysis, we both established a strong correlation between gently-shaped landscapes and robustness to input and weight perturbations and observed other intriguing and non-obvious phenomena. Our method allows a systematic exploration of such trade-offs a priori, i.e., without training and testing multiple models, leading to more efficient development workflows. This work also highlights the importance of incorporating robustness into the Pareto optimization of ML models, enabling more reliable and adaptive scientific sensing systems.
