Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI
Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang
TL;DR
The paper argues that Bayesian Deep Learning is essential in the age of large-scale AI because it provides principled uncertainty quantification, data efficiency, and adaptability to evolving domains, all of which are crucial for safe and reliable deployment of foundation models. It surveys the strengths and current challenges of BDL—posterior inference, priors, scalability, and foundation-model integration—and outlines concrete future directions, including novel posterior samplers, hybrid Bayesian approaches, deep kernel methods, semi/self-supervised Bayesian learning, probabilistic numerics, and compression. By linking these methodological advances to practical needs in uncertainty-aware decision-making, the authors advocate for integrating BDL with large models to unlock robust, trustworthy AI across domains. The discussion emphasizes the potential of BDL to enhance reliability and interpretability, while calling for scalable tooling, benchmarks, and application-driven development, particularly for foundation-model workflows.
Abstract
In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learning (BDL) constitutes a promising avenue, offering advantages across these diverse settings. This paper posits that BDL can elevate the capabilities of deep learning. It revisits the strengths of BDL, acknowledges existing challenges, and highlights some exciting research avenues aimed at addressing these obstacles. Looking ahead, the discussion focuses on possible ways to combine large-scale foundation models with BDL to unlock their full potential.
