Accelerating Convergence in Bayesian Few-Shot Classification
Tianjun Ke, Haoqun Cao, Feng Zhou
TL;DR
This paper tackles non-conjugate inference in Bayesian Gaussian-process-based few-shot classification by introducing Mirror Descent-based Variational Inference (MD-VI). It recasts VI updates on GP FSC into conjugate-like steps that exploit non-Euclidean geometry, achieving faster inner-loop convergence and parameterization invariance while preserving uncertainty quantification. The authors present a bi-level framework where task-specific VI updates occur in the inner loop and a deep-kernel GP prior is learned in the outer loop, with theoretical equivalence to natural gradient methods and a conjugate Bayesian interpretation. Empirical results show competitive accuracy and improved calibration across multiple FSC benchmarks, with systematic analysis of hyperparameters and convergence behavior. Overall, MD-BFSC offers a principled, efficient alternative for Bayesian meta-learning in few-shot classification, aided by publicly available code.
Abstract
Bayesian few-shot classification has been a focal point in the field of few-shot learning. This paper seamlessly integrates mirror descent-based variational inference into Gaussian process-based few-shot classification, addressing the challenge of non-conjugate inference. By leveraging non-Euclidean geometry, mirror descent achieves accelerated convergence by providing the steepest descent direction along the corresponding manifold. It also exhibits the parameterization invariance property concerning the variational distribution. Experimental results demonstrate competitive classification accuracy, improved uncertainty quantification, and faster convergence compared to baseline models. Additionally, we investigate the impact of hyperparameters and components. Code is publicly available at https://github.com/keanson/MD-BSFC.
