On the Stability of Neural Networks in Deep Learning
Blaise Delattre
TL;DR
This work presents a unified framework for stabilizing neural networks through sensitivity analysis by combining Lipschitz constraints, curvature-based regularization, and randomized smoothing. It develops fast, deterministic spectral-norm estimation (Gram iteration) for dense and convolutional layers, introduces Lipschitz-by-design layers (including CPL and SLL), and demonstrates improved training stability and robustness. The thesis connects Weierstrass smoothing with randomized smoothing to derive tighter robustness certificates and introduces mechanisms like activation decay and LVM-RS to reduce variance and improve certification. Together, these methods yield practical, scalable tools for certifiable robustness and generalization, with broad implications for vision, NLP, and large-scale models. The work highlights open problems in scaling Lipschitz networks, handling attention in Lipschitz architectures, and extending certified robustness to large language models.
Abstract
Deep learning has achieved remarkable success across a wide range of tasks, but its models often suffer from instability and vulnerability: small changes to the input may drastically affect predictions, while optimization can be hindered by sharp loss landscapes. This thesis addresses these issues through the unifying perspective of sensitivity analysis, which examines how neural networks respond to perturbations at both the input and parameter levels. We study Lipschitz networks as a principled way to constrain sensitivity to input perturbations, thereby improving generalization, adversarial robustness, and training stability. To complement this architectural approach, we introduce regularization techniques based on the curvature of the loss function, promoting smoother optimization landscapes and reducing sensitivity to parameter variations. Randomized smoothing is also explored as a probabilistic method for enhancing robustness at decision boundaries. By combining these perspectives, we develop a unified framework where Lipschitz continuity, randomized smoothing, and curvature regularization interact to address fundamental challenges in stability. The thesis contributes both theoretical analysis and practical methodologies, including efficient spectral norm computation, novel Lipschitz-constrained layers, and improved certification procedures.
