Accelerating Non-Conjugate Gaussian Processes By Trading Off Computation For Uncertainty
Lukas Tatzel, Jonathan Wenger, Frank Schneider, Philipp Hennig
TL;DR
This paper tackles the scalability bottleneck of non-conjugate Gaussian processes by explicitly modeling the uncertainty introduced by approximate inference. It introduces IterNCGP, a computation-aware framework that treats Newton steps in Laplace-based inference as a sequence of GP regressions solved by an inner probabilistic linear solver (IterGP), producing a tunable trade-off between computation and uncertainty. Key contributions include policy-driven targeted computations (SoD vs CG), a recycling mechanism to reuse prior computations across Newton steps, and a memory-efficient compression strategy to bound resource use. Experiments on Poisson regression and large-scale GP multiclass classification demonstrate substantial speedups over strong baselines, with competitive predictive performance and calibrated uncertainty.
Abstract
Non-conjugate Gaussian processes (NCGPs) define a flexible probabilistic framework to model categorical, ordinal and continuous data, and are widely used in practice. However, exact inference in NCGPs is prohibitively expensive for large datasets, thus requiring approximations in practice. The approximation error adversely impacts the reliability of the model and is not accounted for in the uncertainty of the prediction. We introduce a family of iterative methods that explicitly model this error. They are uniquely suited to parallel modern computing hardware, efficiently recycle computations, and compress information to reduce both the time and memory requirements for NCGPs. As we demonstrate on large-scale classification problems, our method significantly accelerates posterior inference compared to competitive baselines by trading off reduced computation for increased uncertainty.
