2D Gaussians Spatial Transport for Point-supervised Density Regression
Miao Shang, Xiaopeng Hong
TL;DR
The paper addresses the computational bottleneck of optimal transport (OT) in point-supervised density regression by introducing Gaussian Spatial Transport (GST). GST builds a fixed, precomputable transport kernel from the input image using 2D Gaussian Splatting, enabling the network to be trained via a single matrix multiplication to push the predicted density to the annotation space. A Bayesian transport-based loss with the kernel, L_BT = || K' ilde{\zeta}_d - \zeta_g ||_1, replaces iterative OT optimization, greatly reducing training time while maintaining or improving accuracy. The method is demonstrated on crowd counting and landmark localization, showing competitive or state-of-the-art performance and substantial efficiency gains, with ablations validating the contribution of deformity elimination and background correspondence. Code is provided to reproduce the GST pipeline.
Abstract
This paper introduces Gaussian Spatial Transport (GST), a novel framework that leverages Gaussian splatting to facilitate transport from the probability measure in the image coordinate space to the annotation map. We propose a Gaussian splatting-based method to estimate pixel-annotation correspondence, which is then used to compute a transport plan derived from Bayesian probability. To integrate the resulting transport plan into standard network optimization in typical computer vision tasks, we derive a loss function that measures discrepancy after transport. Extensive experiments on representative computer vision tasks, including crowd counting and landmark detection, validate the effectiveness of our approach. Compared to conventional optimal transport schemes, GST eliminates iterative transport plan computation during training, significantly improving efficiency. Code is available at https://github.com/infinite0522/GST.
