Learning Unified Representation of 3D Gaussian Splatting
Yuelin Xin, Yuheng Liu, Xiaohui Xie, Xinke Li
TL;DR
The paper tackles the challenge of learning 3D Gaussian Splatting by criticizing the native parametric space for non-uniqueness, numerical heterogeneity, and incompatible manifold structure. It introduces a geometry-aware submanifold-field representation, mapping each Gaussian primitive to a color field on its iso-probability surface and enforcing a unique correspondence to the underlying radiance field. A Submanifold Field Variational Autoencoder (SF-VAE) is developed to encode these fields into compact embeddings, with a Wasserstein-2 based Manifold Distance guiding learning to align perceptual quality rather than parameter distance. Across ShapeSplat and Mip-NeRF 360 datasets, the SF-based embeddings yield higher fidelity reconstructions, stronger cross-domain generalization, and more stable latent spaces, with a Gaussian Neural Field demonstrating improved learnability when conditioned on SF embeddings. The findings suggest that geometry-aware embeddings are a robust learning target for 3D Gaussian Splatting and pave the way for diffusion-inspired generation, compression, and downstream neural-field applications.
Abstract
A well-designed vectorized representation is crucial for the learning systems natively based on 3D Gaussian Splatting. While 3DGS enables efficient and explicit 3D reconstruction, its parameter-based representation remains hard to learn as features, especially for neural-network-based models. Directly feeding raw Gaussian parameters into learning frameworks fails to address the non-unique and heterogeneous nature of the Gaussian parameterization, yielding highly data-dependent models. This challenge motivates us to explore a more principled approach to represent 3D Gaussian Splatting in neural networks that preserves the underlying color and geometric structure while enforcing unique mapping and channel homogeneity. In this paper, we propose an embedding representation of 3DGS based on continuous submanifold fields that encapsulate the intrinsic information of Gaussian primitives, thereby benefiting the learning of 3DGS. Implementation available at https://github.com/cilix-ai/gs-embedding
