MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
Dongki Jung, Jaehoon Choi, Yonghan Lee, Sungmin Eum, Heesung Kwon, Dinesh Manocha
TL;DR
MoRe introduces a training-free monocular geometry refinement that achieves cross-view consistency and scale alignment for point maps produced by monocular 3D foundation models. It first performs an affine alignment using inter-view correspondences, then refines the result with a graph-based locally planar optimization that jointly optimizes 3D points and surface normals. This approach preserves the underlying scene structure while mitigating monocular scale ambiguities and improves novel-view synthesis in sparse-view regimes. Across standard benchmarks, MoRe delivers competitive multi-view depth and 3D reconstruction performance and notable gains in cross-view coherence and rendering quality.
Abstract
Monocular 3D foundation models offer an extensible solution for perception tasks, making them attractive for broader 3D vision applications. In this paper, we propose MoRe, a training-free Monocular Geometry Refinement method designed to improve cross-view consistency and achieve scale alignment. To induce inter-frame relationships, our method employs feature matching between frames to establish correspondences. Rather than applying simple least squares optimization on these matched points, we formulate a graph-based optimization framework that performs local planar approximation using the estimated 3D points and surface normals estimated by monocular foundation models. This formulation addresses the scale ambiguity inherent in monocular geometric priors while preserving the underlying 3D structure. We further demonstrate that MoRe not only enhances 3D reconstruction but also improves novel view synthesis, particularly in sparse view rendering scenarios.
