GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
Jingqiu Zhou, Lue Fan, Xuesong Chen, Linjiang Huang, Si Liu, Hongsheng Li
TL;DR
GaussianPainter addressess the non-uniqueness problem in 3D Gaussian Splatting by introducing normal-guided rotations and a multi-scale triplane appearance injection, enabling efficient feed-forward painting of point clouds into high-quality 3D Gaussians from a reference image. The method achieves state-of-the-art novel-view synthesis and cross-object appearance transfer on OmniObject3D and Objaverse while maintaining efficiency with a single forward pass. By constraining the Gaussian parameter space via predicted normals and bridging 2D-3D appearance through triplanes, GaussianPainter enables robust, diverse 3D content creation from point clouds. The work paves the way for scalable, controllable 3D content generation, with potential extensions to scene-scale data and text-driven appearance control.
Abstract
In this paper, we present GaussianPainter, the first method to paint a point cloud into 3D Gaussians given a reference image. GaussianPainter introduces an innovative feed-forward approach to overcome the limitations of time-consuming test-time optimization in 3D Gaussian splatting. Our method addresses a critical challenge in the field: the non-uniqueness problem inherent in the large parameter space of 3D Gaussian splatting. This space, encompassing rotation, anisotropic scales, and spherical harmonic coefficients, introduces the challenge of rendering similar images from substantially different Gaussian fields. As a result, feed-forward networks face instability when attempting to directly predict high-quality Gaussian fields, struggling to converge on consistent parameters for a given output. To address this issue, we propose to estimate a surface normal for each point to determine its Gaussian rotation. This strategy enables the network to effectively predict the remaining Gaussian parameters in the constrained space. We further enhance our approach with an appearance injection module, incorporating reference image appearance into Gaussian fields via a multiscale triplane representation. Our method successfully balances efficiency and fidelity in 3D Gaussian generation, achieving high-quality, diverse, and robust 3D content creation from point clouds in a single forward pass.
