On Learning Latent Models with Multi-Instance Weak Supervision

Kaifu Wang; Efthymia Tsamoura; Dan Roth

On Learning Latent Models with Multi-Instance Weak Supervision

Kaifu Wang, Efthymia Tsamoura, Dan Roth

TL;DR

This work formulates learning Latent models under multi-instance weak supervision (multi-instance PLL), where a deterministic transition $σ$ maps hidden labels from $M$ instances to a weak label $s$. It provides a theoretical foundation by introducing the necessary-and-sufficient $M$-unambiguity condition for learnability under known, and later unknown, transitions, and derives Rademacher-style error bounds using a top-$k$ semantic loss. The paper extends the theory from a single classifier to multiple classifiers and analyzes learning when $σ$ is unknown via an unambiguous transition-space $\mathcal{G}$, with bounded-risk assumptions. Empirical results on neuro-symbolic-style MNIST tasks validate the theory and highlight scalability challenges inherent to weak supervision with complex logical constraints. Overall, the work advances understanding of how multi-instance supervision and logical reasoning can be integrated with learning while offering concrete generalization guarantees and practical insights for scalable neuro-symbolic systems.

Abstract

We consider a weakly supervised learning scenario where the supervision signal is generated by a transition function $σ$ of labels associated with multiple input instances. We formulate this problem as \emph{multi-instance Partial Label Learning (multi-instance PLL)}, which is an extension to the standard PLL problem. Our problem is met in different fields, including latent structural learning and neuro-symbolic integration. Despite the existence of many learning techniques, limited theoretical analysis has been dedicated to this problem. In this paper, we provide the first theoretical study of multi-instance PLL with possibly an unknown transition $σ$. Our main contributions are as follows. Firstly, we propose a necessary and sufficient condition for the learnability of the problem. This condition non-trivially generalizes and relaxes the existing small ambiguity degree in the PLL literature, since we allow the transition to be deterministic. Secondly, we derive Rademacher-style error bounds based on a top-$k$ surrogate loss that is widely used in the neuro-symbolic literature. Furthermore, we conclude with empirical experiments for learning under unknown transitions. The empirical results align with our theoretical findings; however, they also expose the issue of scalability in the weak supervision literature.

On Learning Latent Models with Multi-Instance Weak Supervision

TL;DR

This work formulates learning Latent models under multi-instance weak supervision (multi-instance PLL), where a deterministic transition

maps hidden labels from

instances to a weak label

. It provides a theoretical foundation by introducing the necessary-and-sufficient

-unambiguity condition for learnability under known, and later unknown, transitions, and derives Rademacher-style error bounds using a top-

semantic loss. The paper extends the theory from a single classifier to multiple classifiers and analyzes learning when

is unknown via an unambiguous transition-space

, with bounded-risk assumptions. Empirical results on neuro-symbolic-style MNIST tasks validate the theory and highlight scalability challenges inherent to weak supervision with complex logical constraints. Overall, the work advances understanding of how multi-instance supervision and logical reasoning can be integrated with learning while offering concrete generalization guarantees and practical insights for scalable neuro-symbolic systems.

Abstract

We consider a weakly supervised learning scenario where the supervision signal is generated by a transition function

of labels associated with multiple input instances. We formulate this problem as \emph{multi-instance Partial Label Learning (multi-instance PLL)}, which is an extension to the standard PLL problem. Our problem is met in different fields, including latent structural learning and neuro-symbolic integration. Despite the existence of many learning techniques, limited theoretical analysis has been dedicated to this problem. In this paper, we provide the first theoretical study of multi-instance PLL with possibly an unknown transition

. Our main contributions are as follows. Firstly, we propose a necessary and sufficient condition for the learnability of the problem. This condition non-trivially generalizes and relaxes the existing small ambiguity degree in the PLL literature, since we allow the transition to be deterministic. Secondly, we derive Rademacher-style error bounds based on a top-

surrogate loss that is widely used in the neuro-symbolic literature. Furthermore, we conclude with empirical experiments for learning under unknown transitions. The empirical results align with our theoretical findings; however, they also expose the issue of scalability in the weak supervision literature.

Paper Structure (33 sections, 22 theorems, 121 equations, 1 figure, 3 tables)

This paper contains 33 sections, 22 theorems, 121 equations, 1 figure, 3 tables.

Introduction
Preliminaries
Learning a Single Classifier Under a Known Transition
ERM Learnability
Error Bounds with the Top-$k$ Semantic Loss
Learning Multiple Classifiers Under a Known Transition
ERM Learnability
Learning Under an Unknown Transition
ERM Learnability
Experiments
Related Work
Conclusions
Preliminaries
Loss functions
VC and Natarajan dimensions
...and 18 more sections

Key Result

Lemma 1

If $\sigma$ is $M$-unambigous, then we have: Moreover, if $\sigma$ is not $M$-unambiguous, then learning from partial labels is arbitrarily difficult, in the sense that a classifier $f$ with partial risk ${\mathcal{R}^{01}_\mathsf{P}(f;\sigma)=0}$ can have a risk of ${\mathcal{R}^{01}(f)=1}$.

Figures (1)

Figure 1: Multi-Instance PLL. We aim to learn the $f_i$'s given the $x_i$'s and $s$. $M$ may be different from $n$ and $\sigma$ may be unknown.

Theorems & Definitions (59)

Example 1: SUM2
Definition 1: $M$-unambiguity
Example 2
Lemma 1
Theorem 1: ERM learnability under $M$-unambiguity
Definition 2: 1-unambiguity
Example 3
Proposition 1: ERM learnability under 1- and $M$-unambiguity
Example 4
Definition 3: Top-$k$ partial loss
...and 49 more

On Learning Latent Models with Multi-Instance Weak Supervision

TL;DR

Abstract

On Learning Latent Models with Multi-Instance Weak Supervision

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (59)