Logical Guidance for the Exact Composition of Diffusion Models
Francesco Alesiani, Jonathan Warrell, Tanja Bien, Henrik Christiansen, Matheus Ferraz, Mathias Niepert
TL;DR
LoGDiff addresses the lack of principled support for complex Boolean guidance in diffusion models by deriving an exact Boolean calculus that composes atomic guidance signals under circuit-structured constraints. It replaces fixed weights with probability-dependent coefficients, enabling exact posterior-based guidance for conjunctions, disjunctions, and negations through recursive rules; the framework also provides completeness results for common distribution classes via compilability into probabilistic circuits. The approach is instantiated as a hybrid strategy that combines classifier-free scores with posterior estimates, and is demonstrated on both image generation tasks and protein-ligand design, including repulsive guidance to reduce class confusion. Overall, LoGDiff enables expressive, inference-time constraint satisfaction with diffusion models, with practical impact for controllable generation in vision and drug design.
Abstract
We propose LOGDIFF (Logical Guidance for the Exact Composition of Diffusion Models), a guidance framework for diffusion models that enables principled constrained generation with complex logical expressions at inference time. We study when exact score-based guidance for complex logical formulas can be obtained from guidance signals associated with atomic properties. First, we derive an exact Boolean calculus that provides a sufficient condition for exact logical guidance. Specifically, if a formula admits a circuit representation in which conjunctions combine conditionally independent subformulas and disjunctions combine subformulas that are either conditionally independent or mutually exclusive, exact logical guidance is achievable. In this case, the guidance signal can be computed exactly from atomic scores and posterior probabilities using an efficient recursive algorithm. Moreover, we show that, for commonly encountered classes of distributions, any desired Boolean formula is compilable into such a circuit representation. Second, by combining atomic guidance scores with posterior probability estimates, we introduce a hybrid guidance approach that bridges classifierguidance and classifier-free guidance, applicable to both compositional logical guidance and standard conditional generation. We demonstrate the effectiveness of our framework on multiple image and protein structure generation tasks.
