Neural Approaches to SAT Solving: Design Choices and Interpretability
David Mojžíšek, Jan Hůla, Ziwei Li, Ziyu Zhou, Mikoláš Janota
TL;DR
This work examines neural SAT solving with graph neural networks, focusing on how design choices affect generalization to unseen instances. It shows that a variable-clause graph (VCG) with recurrent updates, combined with a closest assignment training objective, achieves strong SAT assignment accuracy and scalable inference via extra iterations or resampling. The authors extend the base model with a diffusion-based assignment generator, connecting the approach to continuous relaxation views of MaxSAT and enabling effective test-time adaptation, especially when interleaved with unit propagation. Analyses of embedding spaces and optimization trajectories reveal that the GNNimplicitly performs a continuous relaxation-like optimization, offering an interpretable lens on neural reasoning for combinatorial tasks and guiding future hybrid solver designs.
Abstract
In this contribution, we provide a comprehensive evaluation of graph neural networks applied to Boolean satisfiability problems, accompanied by an intuitive explanation of the mechanisms enabling the model to generalize to different instances. We introduce several training improvements, particularly a novel closest assignment supervision method that dynamically adapts to the model's current state, significantly enhancing performance on problems with larger solution spaces. Our experiments demonstrate the suitability of variable-clause graph representations with recurrent neural network updates, which achieve good accuracy on SAT assignment prediction while reducing computational demands. We extend the base graph neural network into a diffusion model that facilitates incremental sampling and can be effectively combined with classical techniques like unit propagation. Through analysis of embedding space patterns and optimization trajectories, we show how these networks implicitly perform a process very similar to continuous relaxations of MaxSAT, offering an interpretable view of their reasoning process. This understanding guides our design choices and explains the ability of recurrent architectures to scale effectively at inference time beyond their training distribution, which we demonstrate with test-time scaling experiments.
