Diffusion Posterior Sampling is Computationally Intractable
Shivam Gupta, Ajil Jalal, Aditya Parulekar, Eric Price, Zhiyang Xun
TL;DR
This work proves that posterior sampling for diffusion models under a linear measurement model is computationally intractable in general, assuming the existence of one-way functions. It contrasts efficient unconditional sampling, which diffusion models can achieve for well-modeled distributions, with hardness results for posterior inference, deriving both superpolynomial and exponential-time lower bounds. The authors develop a well-modeled hard instance $\widetilde{g}$ and show that any $(\tfrac{1}{10},\tfrac{1}{10})$-posterior sampler would imply inverting a one-way function, while also showing that the associated smoothed-score can be represented by a polynomial-size ReLU network. They further provide an upper-bound analysis via rejection sampling, illustrating that, under exponential-time inverters, rejection sampling is near-optimal. Overall, the paper delineates the limitations of general-purpose posterior sampling and motivates distributional assumptions or new algorithms tailored to specific task priors.
Abstract
Diffusion models are a remarkably effective way of learning and sampling from a distribution $p(x)$. In posterior sampling, one is also given a measurement model $p(y \mid x)$ and a measurement $y$, and would like to sample from $p(x \mid y)$. Posterior sampling is useful for tasks such as inpainting, super-resolution, and MRI reconstruction, so a number of recent works have given algorithms to heuristically approximate it; but none are known to converge to the correct distribution in polynomial time. In this paper we show that posterior sampling is computationally intractable: under the most basic assumption in cryptography -- that one-way functions exist -- there are instances for which every algorithm takes superpolynomial time, even though unconditional sampling is provably fast. We also show that the exponential-time rejection sampling algorithm is essentially optimal under the stronger plausible assumption that there are one-way functions that take exponential time to invert.
