A Complexity Map of Probabilistic Reasoning for Neurosymbolic Classification Techniques
Arthur Ledaguenel, Céline Hudelot, Mostepha Khouadjia
TL;DR
The paper tackles the scalability challenge of neurosymbolic probabilistic reasoning by developing a unified formalism for four core reasoning problems and constructing the first complexity map that ties tractability to succinct representation languages. It systematically analyzes how different knowledge representations—especially hierarchical, cardinal, simple-path, and matching constraints—affect the feasibility of probabilistic queries, optimization, and enumeration, using knowledge compilation into $d$-DNNF and related targets. The authors provide new tractability results (e.g., for Card and ASPath) and demonstrate that common compilation targets like DNNF/d-DNNF do not cover all tractable cases (as with Match), highlighting limitations and guiding practitioners toward appropriate representation choices. Collectively, the work offers actionable guidance for selecting succinct languages and compilation strategies to scale probabilistic neurosymbolic techniques to large-class, real-world tasks.
Abstract
Neurosymbolic artificial intelligence is a growing field of research aiming to combine neural network learning capabilities with the reasoning abilities of symbolic systems. Informed multi-label classification is a sub-field of neurosymbolic AI which studies how to leverage prior knowledge to improve neural classification systems. Recently, a family of neurosymbolic techniques for informed classification based on probabilistic reasoning has gained significant traction. Unfortunately, depending on the language used to represent prior knowledge, solving certain probabilistic reasoning problems can become prohibitively hard when the number of classes increases. Therefore, the asymptotic complexity of probabilistic reasoning is of cardinal importance to assess the scalability of such techniques. In this paper, we develop a unified formalism for four probabilistic reasoning problems. Then, we compile several known and new tractability results into a single complexity map of probabilistic reasoning. We build on top of this complexity map to characterize the domains of scalability of several techniques. We hope this work will help neurosymbolic AI practitioners navigate the scalability landscape of probabilistic neurosymbolic techniques.
