Table of Contents
Fetching ...

EXPLAIN, AGREE, LEARN: Scaling Learning for Neural Probabilistic Logic

Victor Verreet, Lennert De Smet, Luc De Raedt, Emanuele Sansone

TL;DR

This work tackles the challenge of scalable learning in neural probabilistic logic (NeSy) by introducing a sampling-based surrogate objective $L = \sum_i \mathrm{KL}(Q_i \| P_i)$ that bounds the true data likelihood and avoids expensive exact inference. The EXPLAIN, AGREE, LEARN (EXAL) framework constructs this surrogate by sampling explanations, reweighting them with the neural posterior, and performing standard gradient updates on the neural component. The authors prove optimality and bounding relationships for the surrogate, show that diversity among explanations tightens the bound, and reformulate sampling as a Markov decision process optimized via GFlowNets. Empirically, EXAL demonstrates faster convergence and competitive accuracy on MNIST addition and Warcraft pathfinding compared with state-of-the-art NeSy methods, while providing theoretical guarantees on approximation error. This approach enables scalable NeSy learning with probabilistic logic by leveraging diverse, sampled explanations to guide neural updates.

Abstract

Neural probabilistic logic systems follow the neuro-symbolic (NeSy) paradigm by combining the perceptive and learning capabilities of neural networks with the robustness of probabilistic logic. Learning corresponds to likelihood optimization of the neural networks. However, to obtain the likelihood exactly, expensive probabilistic logic inference is required. To scale learning to more complex systems, we therefore propose to instead optimize a sampling based objective. We prove that the objective has a bounded error with respect to the likelihood, which vanishes when increasing the sample count. Furthermore, the error vanishes faster by exploiting a new concept of sample diversity. We then develop the EXPLAIN, AGREE, LEARN (EXAL) method that uses this objective. EXPLAIN samples explanations for the data. AGREE reweighs each explanation in concordance with the neural component. LEARN uses the reweighed explanations as a signal for learning. In contrast to previous NeSy methods, EXAL can scale to larger problem sizes while retaining theoretical guarantees on the error. Experimentally, our theoretical claims are verified and EXAL outperforms recent NeSy methods when scaling up the MNIST addition and Warcraft pathfinding problems.

EXPLAIN, AGREE, LEARN: Scaling Learning for Neural Probabilistic Logic

TL;DR

This work tackles the challenge of scalable learning in neural probabilistic logic (NeSy) by introducing a sampling-based surrogate objective that bounds the true data likelihood and avoids expensive exact inference. The EXPLAIN, AGREE, LEARN (EXAL) framework constructs this surrogate by sampling explanations, reweighting them with the neural posterior, and performing standard gradient updates on the neural component. The authors prove optimality and bounding relationships for the surrogate, show that diversity among explanations tightens the bound, and reformulate sampling as a Markov decision process optimized via GFlowNets. Empirically, EXAL demonstrates faster convergence and competitive accuracy on MNIST addition and Warcraft pathfinding compared with state-of-the-art NeSy methods, while providing theoretical guarantees on approximation error. This approach enables scalable NeSy learning with probabilistic logic by leveraging diverse, sampled explanations to guide neural updates.

Abstract

Neural probabilistic logic systems follow the neuro-symbolic (NeSy) paradigm by combining the perceptive and learning capabilities of neural networks with the robustness of probabilistic logic. Learning corresponds to likelihood optimization of the neural networks. However, to obtain the likelihood exactly, expensive probabilistic logic inference is required. To scale learning to more complex systems, we therefore propose to instead optimize a sampling based objective. We prove that the objective has a bounded error with respect to the likelihood, which vanishes when increasing the sample count. Furthermore, the error vanishes faster by exploiting a new concept of sample diversity. We then develop the EXPLAIN, AGREE, LEARN (EXAL) method that uses this objective. EXPLAIN samples explanations for the data. AGREE reweighs each explanation in concordance with the neural component. LEARN uses the reweighed explanations as a signal for learning. In contrast to previous NeSy methods, EXAL can scale to larger problem sizes while retaining theoretical guarantees on the error. Experimentally, our theoretical claims are verified and EXAL outperforms recent NeSy methods when scaling up the MNIST addition and Warcraft pathfinding problems.
Paper Structure (21 sections, 4 theorems, 10 equations, 4 figures, 2 tables, 2 algorithms)

This paper contains 21 sections, 4 theorems, 10 equations, 4 figures, 2 tables, 2 algorithms.

Key Result

Theorem 1

Let $\Psi \subseteq \Phi$ be the explanations sampled by EXPLAIN and define $Q^\ast(f) = P(f \mid x) / \sum_{f' \in \Psi} P(f' \mid x)$, then: The first result tells us that we can solve the minimization of $L$ with respect to $Q$ analytically, avoiding iterative optimization methods. The value reached by the global minimizer is the estimator of the true log-likelihood that we use during the LEAR

Figures (4)

  • Figure 1: The neural component takes the input $x$ (left) to output probabilities over $n = 4$ variables. At the top, NeSy-WMC calculates the probability that $\phi$ is satisfied. Below, EXAL finds and reweighs explanations for $\phi$ to construct $L$.
  • Figure 2: Probabilistic graphical model for NeSy-WMC. The full arrows follow the flow of information during inference. The dashed arrows follow the execution of EXAL, where explanations $f$ are sampled given $\phi$ and reweighed based on the neural output given $x$.
  • Figure 3: Diversity as a function of EXPLAIN calls for different formulas and sampling strategies. The factor is $\exp(-\theta) \in \{ 0.1, 0.4, 0.7, 1 \}$.
  • Figure 4: Convergence of objectives for increasing sample count.

Theorems & Definitions (17)

  • Definition 1
  • Definition 2
  • Definition 3
  • Example 1
  • Definition 4
  • Definition 5
  • Example 2
  • Theorem 1
  • proof
  • Example 3
  • ...and 7 more