PoGO: A Scalable Proof of Useful Work via Quantized Gradient Descent and Merkle Proofs

José I. Orlicki

PoGO: A Scalable Proof of Useful Work via Quantized Gradient Descent and Merkle Proofs

José I. Orlicki

TL;DR

PoGO tackles the challenge of turning expensive ML training into verifiable blockchain work by requiring miners to demonstrate genuine progress on a loss function $\mathcal{L}(\theta)$. It combines 4-bit quantized gradients with a 32-bit Merkle-tree commitment and a two-phase randomness-based random leaf verification to enable scalable, cryptographic proofs of training progress. An empirical cost analysis shows verification is orders of magnitude cheaper than training for enormous models such as GPT-3 and Gemma-3, justifying the feasibility of a wide-area, incentive-compatible PoUW protocol, albeit with longer block times to accommodate meaningful learning. The framework also addresses data availability, attestation-based finalization, incentive structures, and potential extensions like fine-tuning and zero-knowledge enhancements, highlighting a path toward a robust decentralized ML training ecosystem.

Abstract

We present a design called Proof of Gradient Optimization (PoGO) for blockchain consensus, where miners produce verifiable evidence of training large-scale machine-learning models. Building on previous work, we incorporate quantized gradients (4-bit precision) to reduce storage and computation requirements, while still preserving the ability of verifiers to check that real progress has been made on lowering the model's loss. Additionally, we employ Merkle proofs over the full 32-bit model to handle large parameter sets and to enable random leaf checks with minimal on-chain data. We illustrate these ideas using GPT-3 (175B parameters) as a reference example and also refer to smaller but high-performance models (e.g., Gemma~3 with 27B parameters). We provide an empirical cost analysis showing that verification is significantly cheaper than training, thanks in part to quantization and sampling. We also discuss the necessity of longer block times (potentially hours) when incorporating meaningful training steps, the trade-offs when using specialized GPU hardware, and how binary diffs may incrementally optimize updates. Finally, we note that fine-tuning can be handled in a similar manner, merely changing the dataset and the manner of sampling but preserving the overall verification flow. Our protocol allows verifiers to issue either positive or negative attestations; these are aggregated at finalization to either confirm the update or slash the miner.

PoGO: A Scalable Proof of Useful Work via Quantized Gradient Descent and Merkle Proofs

TL;DR

Abstract

PoGO: A Scalable Proof of Useful Work via Quantized Gradient Descent and Merkle Proofs

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (1)