Pruning Boolean d-DNNF Circuits Through Tseitin-Awareness
Vincent Derkinderen
TL;DR
This work addresses the size blow-up that can occur when converting circuits to CNF via Tseitin transformations, which is common in d-DNNF compilation for tractable weighted model counting. It introduces Tseitin artifacts—tautological subcircuits that emerge after existential quantification of Tseitin variables—and provides a linear-time bottom-up method to detect and prune them, in addition to existentially quantifying Tseitin variables. Empirical results across MCC, CNFT, and neuro-symbolic datasets show large average reductions in circuit size: about 77.5% overall when removing both Tseitin variables and artifacts, with artifact removal contributing an extra 22.2–30.6% depending on the dataset. The findings suggest substantial practical benefits for downstream probabilistic inference tasks and motivate integrating artifact-aware pruning into compilation pipelines to improve efficiency.
Abstract
Boolean circuits in d-DNNF form enable tractable probabilistic inference. However, as a key insight of this work, we show that commonly used d-DNNF compilation approaches introduce irrelevant subcircuits. We call these subcircuits Tseitin artifacts, as they are introduced due to the Tseitin transformation step -- a well-established procedure to transform any circuit into the CNF format required by several d-DNNF knowledge compilers. We discuss how to detect and remove both Tseitin variables and Tseitin artifacts, leading to more succinct circuits. We empirically observe an average size reduction of 77.5% when removing both Tseitin variables and artifacts. The additional pruning of Tseitin artifacts reduces the size by 22.2% on average. This significantly improves downstream tasks that benefit from a more succinct circuit, e.g., probabilistic inference tasks.
