Quantization Robustness of Monotone Operator Equilibrium Networks

James Li; Philip H. W. Leong; Thomas Chaffey

Quantization Robustness of Monotone Operator Equilibrium Networks

James Li, Philip H. W. Leong, Thomas Chaffey

TL;DR

This work analyzes weight quantization as a spectral perturbation of the underlying monotone inclusion, finding a phase transition at the predicted threshold: three- and four-bit post-training quantization diverge, while five-bit and above converge.

Abstract

Monotone operator equilibrium networks are implicit-layer models whose output is the unique equilibrium of a monotone operator, guaranteeing existence, uniqueness, and convergence. When deployed on low-precision hardware, weights are quantized, potentially destroying these guarantees. We analyze weight quantization as a spectral perturbation of the underlying monotone inclusion. Convergence of the quantized solver is guaranteed whenever the spectral-norm weight perturbation is smaller than the monotonicity margin; the displacement between quantized and full-precision equilibria is bounded in terms of the perturbation size and margin; and a condition number characterizing the ratio of the operator norm to the margin links quantization precision to forward error. MNIST experiments confirm a phase transition at the predicted threshold: three- and four-bit post-training quantization diverge, while five-bit and above converge. The backward-pass guarantee enables quantization-aware training, which recovers provable convergence at four bits.

Quantization Robustness of Monotone Operator Equilibrium Networks

TL;DR

Abstract

Paper Structure (11 sections, 9 theorems, 14 equations, 3 figures)

This paper contains 11 sections, 9 theorems, 14 equations, 3 figures.

Introduction
Contributions
Related Work
Preliminaries
Monotone operator equilibrium networks
Quantization in a MonDEQ
Margin Perturbation and Well-Posedness
Equilibrium Displacement
Backward Pass Under Quantization
Numerical Experiments
Conclusions

Key Result

Theorem 1

Define a MonDEQ as in Definition def:mondeq. Then $z^\star \in \operatorname{Fix}(\Phi) \Longleftrightarrow 0 \in F(z^\star) + G(z^\star)$.

Figures (3)

Figure 1: Margin stability certificate. Iterations to convergence (top) and final residual (bottom) vs. normalized perturbation $\left\lVert \Delta W \right\rVert_2/m$; each point is one bit-width (3--32 bits). The dashed line marks the sufficient condition $\left\lVert \Delta W \right\rVert_2/m = 1$. Circles: converged (relative residual ${<}\,10^{-5}$); crosses: did not converge within 2000 iterations.
Figure 2: Displacement bound validation (Theorem \ref{['thm:equilibrium_displacement']}) at 6, 8, 12, and 16 bits. Each point is one test sample ($x$-axis: theoretical bound $(\left\lVert \Delta W \right\rVert_2/m)\left\lVert \widetilde{z}^\star \right\rVert_2$; $y$-axis: empirical displacement $\left\lVert \widetilde{z}^\star - z^\star \right\rVert_2$). Points below the dashed line ($y = x$) satisfy the bound.
Figure 3: QAT vs. PTQ at 4, 6, and 8 bits. Left: test accuracy (%; a red X indicates PTQ non-convergence at 4 bits). Right: $\left\lVert \Delta W \right\rVert_2/m$; the dashed line marks $\left\lVert \Delta W \right\rVert_2/m = 1$.

Theorems & Definitions (19)

Definition 1
Theorem 1
Proposition 1
proof
Definition 2
Theorem 2
proof
Corollary 1
proof
Theorem 3
...and 9 more

Quantization Robustness of Monotone Operator Equilibrium Networks

TL;DR

Abstract

Quantization Robustness of Monotone Operator Equilibrium Networks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (19)