Lyapunov-Stable Deep Equilibrium Models
Haoyu Chu, Shikui Wei, Ting Liu, Yao Zhao, Yuto Miyatake
TL;DR
This work introduces LyaDEQ, a Lyapunov-stable Deep Equilibrium Model, to provide provable stability of fixed points defined by $F(\boldsymbol{z}^{\star})=0$. By treating DEQ dynamics as a nonlinear system and learning a convex Lyapunov function $V$ via an ICNN, the method enforces stability conditions that defend against adversarial perturbations; an orthogonal FC layer then separates Lyapunov-stable equilibria to improve class discrimination. Empirical results on MNIST, SVHN, and CIFAR-10/100 show improved robustness against white-box attacks, with further gains when combined with adversarial training methods like TRADES, RD, and PAT. The approach yields competitive clean accuracy and demonstrates substantial robustness benefits, especially at larger attack radii, while remaining compatible with existing defense strategies. Overall, the paper advances a principled, certified-stability pathway for implicit models to resist adversarial threats in practical image classification tasks.
Abstract
Deep equilibrium (DEQ) models have emerged as a promising class of implicit layer models, which abandon traditional depth by solving for the fixed points of a single nonlinear layer. Despite their success, the stability of the fixed points for these models remains poorly understood. By considering DEQ models as nonlinear dynamic systems, we propose a robust DEQ model named LyaDEQ with guaranteed provable stability via Lyapunov theory. The crux of our method is ensuring the Lyapunov stability of the DEQ model's fixed points, which enables the proposed model to resist minor initial perturbations. To avoid poor adversarial defense due to Lyapunov-stable fixed points being located near each other, we orthogonalize the layers after the Lyapunov stability module to separate different fixed points. We evaluate LyaDEQ models under well-known adversarial attacks, and experimental results demonstrate significant improvement in robustness. Furthermore, we show that the LyaDEQ model can be combined with other defense methods, such as adversarial training, to achieve even better adversarial robustness.
