Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection
Hongru Yan, Yu Zheng, Yueqi Duan
TL;DR
This paper tackles RGB-based 3D object detection by shifting from discrete point or NeRF representations to a continuous surface model built from 3D Gaussian Splatting. It introduces Gaussian-Det, which encodes objects as a mass of surface-describing Gaussians and refines proposals via a Closure Inference Module that jointly handles partial surface uncertainty with a variational residual and holistic surface closure quantified through a flux $|\hat{\boldsymbol{\Phi}}|$. The CIM produces a probabilistic, closure-aware holistic representation that serves as a prior to improve detection reliability, yielding superior AP and AR on indoor datasets (3D-FRONT and ScanNet) while maintaining real-time performance. The approach demonstrates strong robustness to outliers and noisy poses, and it broadens the use of surface-based priors for 3D perception tasks with potential extensions to open-vocabulary 3D instance segmentation.
Abstract
Skins wrapping around our bodies, leathers covering over the sofa, sheet metal coating the car - it suggests that objects are enclosed by a series of continuous surfaces, which provides us with informative geometry prior for objectness deduction. In this paper, we propose Gaussian-Det which leverages Gaussian Splatting as surface representation for multi-view based 3D object detection. Unlike existing monocular or NeRF-based methods which depict the objects via discrete positional data, Gaussian-Det models the objects in a continuous manner by formulating the input Gaussians as feature descriptors on a mass of partial surfaces. Furthermore, to address the numerous outliers inherently introduced by Gaussian splatting, we accordingly devise a Closure Inferring Module (CIM) for the comprehensive surface-based objectness deduction. CIM firstly estimates the probabilistic feature residuals for partial surfaces given the underdetermined nature of Gaussian Splatting, which are then coalesced into a holistic representation on the overall surface closure of the object proposal. In this way, the surface information Gaussian-Det exploits serves as the prior on the quality and reliability of objectness and the information basis of proposal refinement. Experiments on both synthetic and real-world datasets demonstrate that Gaussian-Det outperforms various existing approaches, in terms of both average precision and recall.
