Streaming Gaussian Dirichlet Random Fields for Spatial Predictions of High Dimensional Categorical Observations
J. E. San Soucie, H. M. Sosik, Y. Girdhar
TL;DR
This work addresses the challenge of predicting and planning with streaming, spatiotemporal, sparse, high-dimensional categorical observations in autonomous sensing. It introduces the Streaming Gaussian Dirichlet Random Field (S-GDRF), a streaming extension that combines Gaussian-process–driven spatial priors with Dirichlet latent communities over observation categories, enabling interpolation and planning. A novel subsampling-based streaming BBVI with a sparse inducing-point approximation achieves bounded-time, linear-space inference, with complexity $O(n_s m^3)$. Empirical results on plankton imagery and reef imagery show that S-GDRF outperforms a single GP per category (VGP), scales to thousands of categories, and delivers real-time inference suitable for onboard deployment, thereby enabling informative path planning over high-dimensional categorical observations.
Abstract
We present the Streaming Gaussian Dirichlet Random Field (S-GDRF) model, a novel approach for modeling a stream of spatiotemporally distributed, sparse, high-dimensional categorical observations. The proposed approach efficiently learns global and local patterns in spatiotemporal data, allowing for fast inference and querying with a bounded time complexity. Using a high-resolution data series of plankton images classified with a neural network, we demonstrate the ability of the approach to make more accurate predictions compared to a Variational Gaussian Process (VGP), and to learn a predictive distribution of observations from streaming categorical data. S-GDRFs open the door to enabling efficient informative path planning over high-dimensional categorical observations, which until now has not been feasible.
