Decomposition of Neural Discrete Representations for Large-Scale 3D Mapping
Minseong Park, Suhan Woo, Euntai Kim
TL;DR
DNMap addresses memory bottlenecks in large-scale outdoor 3D mapping by replacing full continuous embeddings with a decomposition-based discrete representation whose embedding space is factorized into a shared bias embedding $e_b$ and $B$ offsets $\Delta{\mathbf{e}}_j$, composed through binary indicators $\mathbf{b}$ and stored via an octree feature volume. It augments these discrete embeddings with a low-resolution continuous embedding $c$ to provide global location cues, and uses a shallow decoder to predict the signed distance $d=\Phi(\mathbf{x})$, trained with a BCE-based $L_{sdf}$ loss and the Eikonal regularization $L_e$. Experiments on MaiCity and Newer College show substantial storage reductions while preserving or improving mapping quality compared with SHINE-Mapping and VQ-SHINE-Mapping, and DNMap remains trainable under memory budgets where baselines fail. By focusing on learning composition indicators rather than indexing the entire embedding space, DNMap achieves robust, scalable, memory-efficient large-scale 3D maps for outdoor environments.
Abstract
Learning efficient representations of local features is a key challenge in feature volume-based 3D neural mapping, especially in large-scale environments. In this paper, we introduce Decomposition-based Neural Mapping (DNMap), a storage-efficient large-scale 3D mapping method that employs a discrete representation based on a decomposition strategy. This decomposition strategy aims to efficiently capture repetitive and representative patterns of shapes by decomposing each discrete embedding into component vectors that are shared across the embedding space. Our DNMap optimizes a set of component vectors, rather than entire discrete embeddings, and learns composition rather than indexing the discrete embeddings. Furthermore, to complement the mapping quality, we additionally learn low-resolution continuous embeddings that require tiny storage space. By combining these representations with a shallow neural network and an efficient octree-based feature volume, our DNMap successfully approximates signed distance functions and compresses the feature volume while preserving mapping quality. Our source code is available at https://github.com/minseong-p/dnmap.
