Online Learning with Set-Valued Feedback

Vinod Raman; Unique Subedi; Ambuj Tewari

Online Learning with Set-Valued Feedback

Vinod Raman, Unique Subedi, Ambuj Tewari

TL;DR

The paper introduces online learning with set-valued feedback, where the learner predicts a single label but receives a set of correct labels $S_t$ and pays a loss when $\hat{y}_t \notin S_t$. It defines two key dimensions, the Set Littlestone dimension $\text{SL}(\mathcal{H})$ for deterministic realizable learnability and the Measure Shattering dimension $\text{MS}_{\gamma}(\mathcal{H})$ for randomized realizable and agnostic learnability, with a central role for the Helly number $H(\mathcal{S}(\mathcal{Y}))$ in relating these dimensions. The results show a separation between deterministic and randomized learnability in the realizable setting (unless $H(\mathcal{S}(\mathcal{Y}))<\infty$), provide algorithmic realizations (deterministic SOA and randomized chaining-based methods), and extend to agnostic learnability with minimax regret bounds. Applications to online multilabel ranking, online multilabel classification, and interval-valued prediction illustrate the practical impact of the theory by yielding sharp, dimension-dependent minimax guarantees.

Abstract

We study a variant of online multiclass classification where the learner predicts a single label but receives a \textit{set of labels} as feedback. In this model, the learner is penalized for not outputting a label contained in the revealed set. We show that unlike online multiclass learning with single-label feedback, deterministic and randomized online learnability are \textit{not equivalent} even in the realizable setting with set-valued feedback. Accordingly, we give two new combinatorial dimensions, named the Set Littlestone and Measure Shattering dimension, that tightly characterize deterministic and randomized online learnability respectively in the realizable setting. In addition, we show that the Measure Shattering dimension characterizes online learnability in the agnostic setting and tightly quantifies the minimax regret. Finally, we use our results to establish bounds on the minimax regret for three practical learning settings: online multilabel ranking, online multilabel classification, and real-valued prediction with interval-valued response.

Online Learning with Set-Valued Feedback

TL;DR

The paper introduces online learning with set-valued feedback, where the learner predicts a single label but receives a set of correct labels

and pays a loss when

. It defines two key dimensions, the Set Littlestone dimension

for deterministic realizable learnability and the Measure Shattering dimension

for randomized realizable and agnostic learnability, with a central role for the Helly number

in relating these dimensions. The results show a separation between deterministic and randomized learnability in the realizable setting (unless

), provide algorithmic realizations (deterministic SOA and randomized chaining-based methods), and extend to agnostic learnability with minimax regret bounds. Applications to online multilabel ranking, online multilabel classification, and interval-valued prediction illustrate the practical impact of the theory by yielding sharp, dimension-dependent minimax guarantees.

Abstract

Paper Structure (35 sections, 15 theorems, 46 equations, 4 algorithms)

This paper contains 35 sections, 15 theorems, 46 equations, 4 algorithms.

Introduction
Related Works
Relation to List Online Classification
Preliminaries
Notation
Online Learning
Combinatorial Dimensions
Relations Between Combinatorial Dimensions
Realizable Setting
A Separation Between Deterministic and Randomized Learnability
Deterministic Learnability
Randomized Learnability
Agnostic Setting
Applications
Online Multilabel Ranking
...and 20 more sections

Key Result

Theorem 2

For $\mathcal{S}(\mathcal{Y}) \subseteq \sigma(\mathcal{Y})$ and $\mathcal{H} \subseteq \mathcal{Y}^{\mathcal{X}}$, we have

Theorems & Definitions (38)

Definition 1: Deterministic Online Learner
Definition 2: Randomized Online Learner
Definition 3: Online Agnostic Learnability
Definition 4: Online Realizable Learnability
Definition 5: Littlestone dimension Littlestone1987LearningQWDanielyERMprinciple
Definition 6: $p$-Set Littlestone dimension
Example 1
Definition 7: Set Littlestone dimension
Definition 8: Measure Shattering dimension
Definition 9: Helly Number of $\mathcal{S}(\mathcal{Y})$
...and 28 more

Online Learning with Set-Valued Feedback

TL;DR

Abstract

Online Learning with Set-Valued Feedback

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (38)