Instance Dependent Testing of Samplers using Interval Conditioning
Rishiraj Bhattacharyya, Sourav Chakraborty, Yash Pote, Uddalok Sarkar, Sayantan Sen
TL;DR
The paper tackles the problem of verifying samplers that may output from infinite or discrete domains by introducing instance-dependent testers under interval conditioning. It develops a framework that leverages a convolution with a Triangular distribution to simulate continuous interval conditioning and applies the Tootsie Pop Algorithm to estimate probability masses, yielding toltest and ERtoltest for distance testing. The authors instantiate these ideas in Lachesis, a practical tester for inverse transform samplers, and demonstrate significant empirical speedups (up to 1000x) over prior worst-case approaches. The work advances sampler verification by delivering instance-aware guarantees and scalable testing for both discrete and continuous-like samplers, with broad potential impact on reliability and transparency in probabilistic AI systems.
Abstract
Sampling algorithms play a pivotal role in probabilistic AI. However, verifying if a sampler program indeed samples from the claimed distribution is a notoriously hard problem. Provably correct testers like Barbarik, Teq, Flash, CubeProbe for testing of different kinds of samplers were proposed only in the last few years. All these testers focus on the worst-case efficiency, and do not support verification of samplers over infinite domains, a case occurring frequently in Astronomy, Finance, Network Security, etc. In this work, we design the first tester of samplers with instance-dependent efficiency, allowing us to test samplers over natural numbers. Our tests are developed via a novel distance estimation algorithm between an unknown and a known probability distribution using an interval conditioning framework. The core technical contribution is a new connection with probability mass estimation of a continuous distribution. The practical gains are also substantial: our experiments establish up to 1000x speedup over state-of-the-art testers.
