Zero-Knowledge Proof Based Verifiable Inference of Models
Yunxiao Wang
TL;DR
The paper tackles the problem of verifying AI inference correctness without exposing proprietary parameters. It introduces a zero-knowledge framework built on PLONK/Kimchi with recursive SNARKs and no trusted setup, enabling constant-size proofs that scale with model size. By leveraging CRPC for matrix multiplication and modular components (GeMM, RMSNorm, RoPE, Softmax, SiLU, etc.), it demonstrates ZK-DeepSeek, a SNARK-verifiable version of DeepSeek-V3 with a 671B-parameter scale. The work shows practical verifiability for large LLMs and points to GPU-accelerated implementations to make verifiable AI widely feasible in real-world deployments.
Abstract
Recent advances in artificial intelligence (AI), particularly deep learning, have led to widespread adoption across various applications. Yet, a fundamental challenge persists: how can we verify the correctness of AI model inference when model owners cannot (or will not) reveal their parameters? These parameters represent enormous training costs and valuable intellectual property, making transparent verification difficult. In this paper, we introduce a zero-knowledge framework capable of verifying deep learning inference without exposing model internal parameters. Built on recursively composed zero-knowledge proofs and requiring no trusted setup, our framework supports both linear and nonlinear neural network layers, including matrix multiplication, normalization, softmax, and SiLU. Leveraging the Fiat-Shamir heuristic, we obtain a succinct non-interactive argument of knowledge (zkSNARK) with constant-size proofs. To demonstrate the practicality of our approach, we translate the DeepSeek model into a fully SNARK-verifiable version named ZK-DeepSeek and show experimentally that our framework delivers both efficiency and flexibility in real-world AI verification workloads.
