Pseudorandom Error-Correcting Codes
Miranda Christ, Sam Gunn
TL;DR
This work introduces pseudorandom error-correcting codes (PRCs), a new cryptographic primitive that yields codewords indistinguishable from random to any efficient adversary unless a decoding key is known. By constructing PRCs from LDPC-like codes under standard assumptions such as LPN and planted-XOR (or subexponential-LPN in a weaker regime), the authors achieve robust, undetectable watermarks for language-model outputs and universal steganography that tolerate constant rates of substitutions and deletions. They develop zero-bit public-key PRCs with strong robustness to p-bounded channels and show how to boost to constant-rate, multi-bit PRCs, including deletion-channel variants via a majority-encoding trick. The practical upshot is a framework for quality-preserving watermarking and robust steganography that remains detectable (or publicly attributable) under a wide range of corrupted-channel conditions, enabling secure public attribution and stateless steganography. Overall, PRCs bridge cryptographic hardness assumptions with error-correcting structure to deliver cryptographically secure, robust signaling in AI-generated content with broad implications for watermarking and information hiding in real-world systems.
Abstract
We construct pseudorandom error-correcting codes (or simply pseudorandom codes), which are error-correcting codes with the property that any polynomial number of codewords are pseudorandom to any computationally-bounded adversary. Efficient decoding of corrupted codewords is possible with the help of a decoding key. We build pseudorandom codes that are robust to substitution and deletion errors, where pseudorandomness rests on standard cryptographic assumptions. Specifically, pseudorandomness is based on either $2^{O(\sqrt{n})}$-hardness of LPN, or polynomial hardness of LPN and the planted XOR problem at low density. As our primary application of pseudorandom codes, we present an undetectable watermarking scheme for outputs of language models that is robust to cropping and a constant rate of random substitutions and deletions. The watermark is undetectable in the sense that any number of samples of watermarked text are computationally indistinguishable from text output by the original model. This is the first undetectable watermarking scheme that can tolerate a constant rate of errors. Our second application is to steganography, where a secret message is hidden in innocent-looking content. We present a constant-rate stateless steganography scheme with robustness to a constant rate of substitutions. Ours is the first stateless steganography scheme with provable steganographic security and any robustness to errors.
