Online Learning with Improving Agents: Multiclass, Budgeted Agents and Bandit Learners
Sajad Ashkezari, Shai Ben-David
TL;DR
This work extensively extends previously published results by providing combinatorial dimensions that characterize online learnability in this model, by analyzing the multiclass setup, learnability in a bandit feedback setup, modeling agents'cost for making improvements and more.
Abstract
We investigate the recently introduced model of learning with improvements, where agents are allowed to make small changes to their feature values to be warranted a more desirable label. We extensively extend previously published results by providing combinatorial dimensions that characterize online learnability in this model, by analyzing the multiclass setup, learnability in a bandit feedback setup, modeling agents' cost for making improvements and more.
