A Unified Framework for Human-Allied Learning of Probabilistic Circuits

Athresh Karanam; Saurabh Mathur; Sahil Sidheekh; Sriraam Natarajan

A Unified Framework for Human-Allied Learning of Probabilistic Circuits

Athresh Karanam, Saurabh Mathur, Sahil Sidheekh, Sriraam Natarajan

TL;DR

This work proposes a unified framework to infuse domain knowledge into learning probabilistic circuits (PCs) by encoding various knowledge forms as differentiable domain constraints. Learning becomes a constrained optimization problem where the PC log-likelihood $\mathcal{L}(\mathcal{M}, \mathcal{D})$ is maximized while ensuring constraint satisfaction via a penalty function $\zeta$, with iterative penalty weights $\lambda_t$ updated by a factor $\gamma$ (Algorithm 1). Knowledge is represented as equal or inequality constraints on marginal and conditional queries, encompassing generalization, context-specific independence, monotonic influence, class-imbalance, and privileged information, enabling six instantiations. The framework is validated on synthetic, benchmark, and real-world data using RatSPN and EinsumNet, showing faithful constraint integration and improved generalization over purely data-driven learning, with demonstrated robustness to constraint noise and hyperparameters. Overall, the approach offers a simple, effective pathway for domain-expert guided PC learning in data-scarce, knowledge-rich settings, with broad applicability and potential extensions to relational data and PC structure learning.

Abstract

Probabilistic Circuits (PCs) have emerged as an efficient framework for representing and learning complex probability distributions. Nevertheless, the existing body of research on PCs predominantly concentrates on data-driven parameter learning, often neglecting the potential of knowledge-intensive learning, a particular issue in data-scarce/knowledge-rich domains such as healthcare. To bridge this gap, we propose a novel unified framework that can systematically integrate diverse domain knowledge into the parameter learning process of PCs. Experiments on several benchmarks as well as real world datasets show that our proposed framework can both effectively and efficiently leverage domain knowledge to achieve superior performance compared to purely data-driven learning approaches.

A Unified Framework for Human-Allied Learning of Probabilistic Circuits

TL;DR

is maximized while ensuring constraint satisfaction via a penalty function

, with iterative penalty weights

updated by a factor

(Algorithm 1). Knowledge is represented as equal or inequality constraints on marginal and conditional queries, encompassing generalization, context-specific independence, monotonic influence, class-imbalance, and privileged information, enabling six instantiations. The framework is validated on synthetic, benchmark, and real-world data using RatSPN and EinsumNet, showing faithful constraint integration and improved generalization over purely data-driven learning, with demonstrated robustness to constraint noise and hyperparameters. Overall, the approach offers a simple, effective pathway for domain-expert guided PC learning in data-scarce, knowledge-rich settings, with broad applicability and potential extensions to relational data and PC structure learning.

Abstract

Paper Structure (11 sections, 1 theorem, 7 equations, 1 figure)

This paper contains 11 sections, 1 theorem, 7 equations, 1 figure.

Introduction
Background
Probabilistic Circuits (PCs)
Knowledge-based Learning.
Learning PCs with Domain Knowledge
Encoding knowledge as constraints
Defining the penalty function
Experimental Evaluation
Experimental Setup.
Results
Discussion

Key Result

Theorem 1

If the PC $\mathcal{M}$ is smooth, decomposable, and deterministic, and $\zeta$ is a differentiable, concave function of $\theta,$ then algorithm 1 is guaranteed to converge to the optimal feasible solution of equation eq:opt3, if one exists, as $t_\text{max} \rightarrow{} \infty.$

Figures (1)

Figure 1: 3D Helix dataset: Visualization of the 3D Helix dataset. The train dataset (a) consists of a single helix of length $2\pi$ with Gaussian noise added to it. The test dataset (b) consists of a helix of length $4\pi$ with Gaussian noise added to it. (c) and (d) visualizes 1000 samples generated from EinsumNet with and without incorporating the Generalization Constraint (GC), respectively.

Theorems & Definitions (5)

Definition 1
Definition 2
Definition 3
Theorem 1
proof

A Unified Framework for Human-Allied Learning of Probabilistic Circuits

TL;DR

Abstract

A Unified Framework for Human-Allied Learning of Probabilistic Circuits

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (5)