Do stable neural networks exist for classification problems? -- A new view on stability in AI

Z. N. D. Liu; A. C. Hansen

Do stable neural networks exist for classification problems? -- A new view on stability in AI

Z. N. D. Liu, A. C. Hansen

TL;DR

The paper introduces class stability, a measure $\mathcal{S}^p_{\mathcal{M}}(\overline{f})$ based on the distance to the decision boundary $h^p_{\bar{f}}$, to study stability for discontinuous classification functions. It proves two main results: (i) an Interpolation Theorem showing NNs can interpolate a classification on the $\epsilon$-stable set $\mathcal{M}_{\epsilon}$ while preserving near-original stability, and (ii) a Universal Stability Approximation Theorem ensuring the existence of NN estimators that approximate the target function with stability close to that of the target and with a controllably small mislabelled region. The framework also develops a measure-theoretic view of stability, discusses computability caveats (GHA), and provides explicit stability calculations for canonical sets like cubes and Euclidean balls. Together, these results establish that stable NN approximations exist for classification tasks on compact domains, offering a rigorous lens beyond Lipschitz constants to analyze robustness in AI. The work has potential implications for designing robust classifiers and for theoretical analyses of adversarial stability in discontinuous tasks.

Abstract

In deep learning (DL) the instability phenomenon is widespread and well documented, most commonly using the classical measure of stability, the Lipschitz constant. While a small Lipchitz constant is traditionally viewed as guarantying stability, it does not capture the instability phenomenon in DL for classification well. The reason is that a classification function -- which is the target function to be approximated -- is necessarily discontinuous, thus having an 'infinite' Lipchitz constant. As a result, the classical approach will deem every classification function unstable, yet basic classification functions a la 'is there a cat in the image?' will typically be locally very 'flat' -- and thus locally stable -- except at the decision boundary. The lack of an appropriate measure of stability hinders a rigorous theory for stability in DL, and consequently, there are no proper approximation theoretic results that can guarantee the existence of stable networks for classification functions. In this paper we introduce a novel stability measure $\mathscr{S}(f)$, for any classification function $f$, appropriate to study the stability of discontinuous functions and their approximations. We further prove two approximation theorems: First, for any $ε> 0$ and any classification function $f$ on a \emph{compact set}, there is a neural network (NN) $ψ$, such that $ψ- f \neq 0$ only on a set of measure $< ε$, moreover, $\mathscr{S}(ψ) \geq \mathscr{S}(f) - ε$ (as accurate and stable as $f$ up to $ε$). Second, for any classification function $f$ and $ε> 0$, there exists a NN $ψ$ such that $ψ= f$ on the set of points that are at least $ε$ away from the decision boundary.

Do stable neural networks exist for classification problems? -- A new view on stability in AI

TL;DR

The paper introduces class stability, a measure

based on the distance to the decision boundary

, to study stability for discontinuous classification functions. It proves two main results: (i) an Interpolation Theorem showing NNs can interpolate a classification on the

-stable set

while preserving near-original stability, and (ii) a Universal Stability Approximation Theorem ensuring the existence of NN estimators that approximate the target function with stability close to that of the target and with a controllably small mislabelled region. The framework also develops a measure-theoretic view of stability, discusses computability caveats (GHA), and provides explicit stability calculations for canonical sets like cubes and Euclidean balls. Together, these results establish that stable NN approximations exist for classification tasks on compact domains, offering a rigorous lens beyond Lipschitz constants to analyze robustness in AI. The work has potential implications for designing robust classifiers and for theoretical analyses of adversarial stability in discontinuous tasks.

Abstract

, for any classification function

, appropriate to study the stability of discontinuous functions and their approximations. We further prove two approximation theorems: First, for any

and any classification function

on a \emph{compact set}, there is a neural network (NN)

, such that

only on a set of measure

, moreover,

(as accurate and stable as

up to

). Second, for any classification function

and

, there exists a NN

such that

on the set of points that are at least

away from the decision boundary.

Paper Structure (14 sections, 9 theorems, 76 equations, 2 figures)

This paper contains 14 sections, 9 theorems, 76 equations, 2 figures.

Introduction
Main result
Computability and GHA vs existence of NNs -- Can the brittleness of AI be resolved?
Related work
Lipschitz constant and certificates
Classification functions are inherently 'unstable'
Examples of different stability boundaries
Definitions
Alternative measure for 'robustness'
Properties of the class stability
Class stability of specific sets
Proof of \ref{['interpolation_thm']}
Stability revised
Proof of \ref{['exist_stable']}

Key Result

Theorem 2.1

Let $\mathcal{M}, \mathcal{K} \subset \mathbb{R}^d$, where $\mathcal{K}$ is compact, and $f:\mathcal{M} \rightarrow \mathcal{Y} \subset \mathbb{Z}^+$ be a non-constant classification function where $\mathcal{Y}$ is finite. Recall the extension $\overline{f}: \mathbb{R}^d \rightarrow \overline{\mathc as the $\epsilon$-stable set of $\overline{f}$, where $h^p_{\bar{f}}$ is defined in def:dist. Then,

Figures (2)

Figure 1: Different classes of unstable classification functions.
Figure 2: Step functions with differently placed steps.

Theorems & Definitions (35)

Theorem 2.1: Interpolation theorem for stable sets
Remark 2.2: Deep and Shallow neural networks
Remark 2.3: Interpretation of \ref{['interpolation_thm']}
Theorem 2.4: Universal stability approximation theorem for classification functions
Remark 2.5: Interpretation of \ref{['exist_stable']}
Remark 2.6: Non-compact domains and dependency on the inputs
Proposition 3.1: Unbounded Lipschitz constant for classification functions
Example 3.2
Example 3.3
Example 4.1
...and 25 more

Do stable neural networks exist for classification problems? -- A new view on stability in AI

TL;DR

Abstract

Do stable neural networks exist for classification problems? -- A new view on stability in AI

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (35)