A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference
Or Zamir
TL;DR
The paper demonstrates that simple layerwise delta-consistency checks for floating-point neural inference do not guarantee a correct final output, by constructing a functionally equivalent network F' that admits delta-consistent transcripts steering the final result to any target z with ||z||_infty ≤ R. It provides a concrete trigger-channel construction that widens hidden layers and augments the final layer so that exact inference matches F while allowing adversarial perturbations to accumulate and control the final output under delta-consistency. The key contribution is a rigorous demonstration that layerwise approximate verification is not inherently composable, highlighting the need for stability assumptions or protocol-level reasoning about error propagation in zk-ML. Practically, this warns against assuming naive per-layer checks suffice for verifiable ML inference and urges careful design of threat models and verification protocols that consider adversarially structured networks.
Abstract
A natural and informal approach to verifiable (or zero-knowledge) ML inference over floating-point data is: ``prove that each layer was computed correctly up to tolerance $δ$; therefore the final output is a reasonable inference result''. This short note gives a simple counterexample showing that this inference is false in general: for any neural network, we can construct a functionally equivalent network for which adversarially chosen approximation-magnitude errors in individual layer computations suffice to steer the final output arbitrarily (within a prescribed bounded range).
