360Recon: An Accurate Reconstruction Method Based on Depth Fusion from 360 Images
Zhongmiao Yan, Qi Wu, Songpengcheng Xia, Junyuan Deng, Xiang Mu, Renbiao Jin, Ling Pei
TL;DR
360Recon introduces an ERP-aware multi-view reconstruction framework that tackles distortions in 360° panoramas via a Spherical Feature Extractor and Spherical Sweeping to build a discriminative 4D cost volume. A multi-scale Encoder–Decoder along with ERP-adapted 360° TSDF-style depth fusion yields high-precision depth maps and coherent 3D reconstructions on real-world datasets, without relying on heavy 3D CNNs. The approach achieves state-of-the-art depth estimation and 3D reconstruction across Matterport3D, Stanford2D3D, and OmniScenes, while maintaining efficient runtime and memory utilization. Limitations include occluded-region reconstruction, with future work planned around generative completion and sparse representations to improve completeness and efficiency.
Abstract
360-degree images offer a significantly wider field of view compared to traditional pinhole cameras, enabling sparse sampling and dense 3D reconstruction in low-texture environments. This makes them crucial for applications in VR, AR, and related fields. However, the inherent distortion caused by the wide field of view affects feature extraction and matching, leading to geometric consistency issues in subsequent multi-view reconstruction. In this work, we propose 360Recon, an innovative MVS algorithm for ERP images. The proposed spherical feature extraction module effectively mitigates distortion effects, and by combining the constructed 3D cost volume with multi-scale enhanced features from ERP images, our approach achieves high-precision scene reconstruction while preserving local geometric consistency. Experimental results demonstrate that 360Recon achieves state-of-the-art performance and high efficiency in depth estimation and 3D reconstruction on existing public panoramic reconstruction datasets.
