Mixed Likelihood Variational Gaussian Processes
Kaiwen Wu, Craig Sanders, Benjamin Letham, Phillip Guan
TL;DR
Gaussian processes enable well-calibrated uncertainty, but standard GP models struggle to ingest diverse auxiliary data. The authors introduce mixed likelihood variational GPs to jointly model multiple data types from a single latent function by maximizing an ELBO that sums over likelihoods: $\log p(\mathbf{y}|\mathbf{f}) = \sum_{t=1}^T \log p_t(\mathbf{y}^{(t)}|\mathbf{f}^{(t)})$ with a KL term for $q(\mathbf{u})$. They demonstrate practical gains in active learning and preference learning across visual, haptic, and robotic domains, using both synthetic and real-world data and a novel Likert-scale likelihood for confidence ratings. The results show that incorporating auxiliary information via mixed likelihoods improves data efficiency and predictive performance in human-in-the-loop experiments.
Abstract
Gaussian processes (GPs) are powerful models for human-in-the-loop experiments due to their flexibility and well-calibrated uncertainty. However, GPs modeling human responses typically ignore auxiliary information, including a priori domain expertise and non-task performance information like user confidence ratings. We propose mixed likelihood variational GPs to leverage auxiliary information, which combine multiple likelihoods in a single evidence lower bound to model multiple types of data. We demonstrate the benefits of mixing likelihoods in three real-world experiments with human participants. First, we use mixed likelihood training to impose prior knowledge constraints in GP classifiers, which accelerates active learning in a visual perception task where users are asked to identify geometric errors resulting from camera position errors in virtual reality. Second, we show that leveraging Likert scale confidence ratings by mixed likelihood training improves model fitting for haptic perception of surface roughness. Lastly, we show that Likert scale confidence ratings improve human preference learning in robot gait optimization. The modeling performance improvements found using our framework across this diverse set of applications illustrates the benefits of incorporating auxiliary information into active learning and preference learning by using mixed likelihoods to jointly model multiple inputs.
