Transductive Active Learning: Theory and Applications
Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause
TL;DR
This work extends active learning to transductive settings where sampling is restricted to a sample space $\mathcal{S}$ while predictions target a broader space $\mathcal{A}$. It develops a principled theory showing convergence of uncertainty about $\boldsymbol{f}_{\mathcal{A}}$ to the irreducible limit using GP/RKHS models, and demonstrates that sampling to minimize posterior uncertainty yields superior sample efficiency compared to traditional uncertainty sampling. The authors instantiate this framework in two practical domains: active fine-tuning of large neural networks and safe Bayesian optimization, achieving state-of-the-art or competitive performance while respecting safety constraints. They also propose batch strategies via conditional embeddings to maintain diversity and relevance during data selection, and provide extensive experiments on MNIST/CIFAR-100 and a quadcopter control problem. Overall, TAL offers a flexible, info-theoretic pathway for directed learning with provable guarantees and strong empirical impact across real-world domains.
Abstract
We study a generalization of classical active learning to real-world settings with concrete prediction targets where sampling is restricted to an accessible region of the domain, while prediction targets may lie outside this region. We analyze a family of decision rules that sample adaptively to minimize uncertainty about prediction targets. We are the first to show, under general regularity assumptions, that such decision rules converge uniformly to the smallest possible uncertainty obtainable from the accessible data. We demonstrate their strong sample efficiency in two key applications: active fine-tuning of large neural networks and safe Bayesian optimization, where they achieve state-of-the-art performance.
