LoGoColor: Local-Global 3D Colorization for 360° Scenes
Yeonjin Chang, Juhwan Cho, Seunghyeon Seo, Wonsik Shin, Nojun Kwak
TL;DR
The paper tackles the problem of colorizing geometry-only 3D reconstructions, especially for 360° scenes, where prior methods rely on averaging 2D color model outputs and lose color diversity. It introduces LoGoColor, a Local-Global pipeline that first reconstructs single-channel geometry, decomposes the scene into subscenes, and then uses a fine-tuned multi-view diffusion model to enforce both inter-subscene and intra-subscene color consistency, followed by local color propagation. A novel Color Diversity Index (CDI) is proposed to quantify color richness, and extensive experiments on LLFF, Mip-NeRF 360, and Tanks and Temples show improved color diversity with competitive consistency compared to ColorNeRF and ChromaDistill, including applications to thermal data. The approach demonstrates robust multi-view colorization for complex 360° scenes, enabling richer visualizations for VR/AR and related applications.
Abstract
Single-channel 3D reconstruction is widely used in fields such as robotics and medical imaging. While this line of work excels at reconstructing 3D geometry, the outputs are not colored 3D models, thus 3D colorization is required for visualization. Recent 3D colorization studies address this problem by distilling 2D image colorization models. However, these approaches suffer from an inherent inconsistency of 2D image models. This results in colors being averaged during training, leading to monotonous and oversimplified results, particularly in complex 360° scenes. In contrast, we aim to preserve color diversity by generating a new set of consistently colorized training views, thereby bypassing the averaging process. Nevertheless, eliminating the averaging process introduces a new challenge: ensuring strict multi-view consistency across these colorized views. To achieve this, we propose LoGoColor, a pipeline designed to preserve color diversity by eliminating this guidance-averaging process with a `Local-Global' approach: we partition the scene into subscenes and explicitly tackle both inter-subscene and intra-subscene consistency using a fine-tuned multi-view diffusion model. We demonstrate that our method achieves quantitatively and qualitatively more consistent and plausible 3D colorization on complex 360° scenes than existing methods, and validate its superior color diversity using a novel Color Diversity Index.
