Equivalence and Similarity Refutation for Probabilistic Programs
Krishnendu Chatterjee, Ehsan Kafshdar Goharshady, Petr Novotný, Đorđe Žikelić
TL;DR
This work tackles static verification of equivalence and similarity between the output distributions of probabilistic programs. It introduces a novel martingale-based framework that jointly synthesizes a function $f$ and two state functions (a UESM for the first program and a LESM for the second) to certify non-equivalence via $\mathbb{E}_{\mu_1}[f] \neq \mathbb{E}_{\mu_2}[f]$. The approach yields formal guarantees, is automated, and extends to infinite-state programs; for similarity, it can provide lower bounds on the Kantorovich distance by requiring $f$ to be $1$-Lipschitz. An automated constraint-solver pipeline synthesizes $f$, $U_f$, and $L_f$ as polynomial templates and translates the resulting conditions into a linear program, enabling scalable analysis on polynomial probabilistic programs. The results demonstrate effective refutation and distance lower-bounding across a diverse set of benchmarks, highlighting the method’s practical impact for debugging and compiler verification in probabilistic programming.
Abstract
We consider the problems of statically refuting equivalence and similarity of output distributions defined by a pair of probabilistic programs. Equivalence and similarity are two fundamental relational properties of probabilistic programs that are essential for their correctness both in implementation and in compilation. In this work, we present a new method for static equivalence and similarity refutation. Our method refutes equivalence and similarity by computing a function over program outputs whose expected value with respect to the output distributions of two programs is different. The function is computed simultaneously with an upper expectation supermartingale and a lower expectation submartingale for the two programs, which we show to together provide a formal certificate for refuting equivalence and similarity. To the best of our knowledge, our method is the first approach to relational program analysis to offer the combination of the following desirable features: (1) it is fully automated, (2) it is applicable to infinite-state probabilistic programs, and (3) it provides formal guarantees on the correctness of its results. We implement a prototype of our method and our experiments demonstrate the effectiveness of our method to refute equivalence and similarity for a number of examples collected from the literature.
