$R^2$-Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement

Haoyang Wang; Liming Liu; Quanlu Jia; Jiangkai Wu; Haodan Zhang; Peiheng Wang; Xinggong Zhang

$R^2$-Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement

Haoyang Wang, Liming Liu, Quanlu Jia, Jiangkai Wu, Haodan Zhang, Peiheng Wang, Xinggong Zhang

TL;DR

This work addresses high-fidelity mesh reconstruction from multi-view images by coupling NeRF-based initialization with online reinforcement learning-guided viewpoint enrichment and differentiable mesh refinement. Stage 1 uses NeRF to generate a coarse $SDF$ and view-dependent appearance; Stage 2 employs an online UCB strategy to select NeRF-rendered viewpoints and jointly refine geometry and appearance through differentiable mesh extraction and rasterization, exporting the final mesh via the NeRF2Mesh workflow. The key contributions are a flexible two-stage refinement that updates both vertex positions and connectivity, an online UCB-based viewpoint enrichment technique that boosts rendering quality, and strong empirical results on NeRF-synthetic scenes showing improvements in both $CD$ and perceptual metrics such as $PSNR$, $SSIM$, and $LPIPS$. This approach advances high-fidelity geometry and rendering, with broad applicability to NeRF-based mesh reconstruction frameworks and potential impact on VR, medical imaging, and robotics workflows.

Abstract

Mesh reconstruction based on Neural Radiance Fields (NeRF) is popular in a variety of applications such as computer graphics, virtual reality, and medical imaging due to its efficiency in handling complex geometric structures and facilitating real-time rendering. However, existing works often fail to capture fine geometric details accurately and struggle with optimizing rendering quality. To address these challenges, we propose a novel algorithm that progressively generates and optimizes meshes from multi-view images. Our approach initiates with the training of a NeRF model to establish an initial Signed Distance Field (SDF) and a view-dependent appearance field. Subsequently, we iteratively refine the SDF through a differentiable mesh extraction method, continuously updating both the vertex positions and their connectivity based on the loss from mesh differentiable rasterization, while also optimizing the appearance representation. To further leverage high-fidelity and detail-rich representations from NeRF, we propose an online-learning strategy based on Upper Confidence Bound (UCB) to enhance viewpoints by adaptively incorporating images rendered by the initial NeRF model into the training dataset. Through extensive experiments, we demonstrate that our method delivers highly competitive and robust performance in both mesh rendering quality and geometric quality.

$R^2$-Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement

TL;DR

and view-dependent appearance; Stage 2 employs an online UCB strategy to select NeRF-rendered viewpoints and jointly refine geometry and appearance through differentiable mesh extraction and rasterization, exporting the final mesh via the NeRF2Mesh workflow. The key contributions are a flexible two-stage refinement that updates both vertex positions and connectivity, an online UCB-based viewpoint enrichment technique that boosts rendering quality, and strong empirical results on NeRF-synthetic scenes showing improvements in both

and perceptual metrics such as

, and

. This approach advances high-fidelity geometry and rendering, with broad applicability to NeRF-based mesh reconstruction frameworks and potential impact on VR, medical imaging, and robotics workflows.

Abstract

Paper Structure (19 sections, 8 equations, 5 figures, 4 tables)

This paper contains 19 sections, 8 equations, 5 figures, 4 tables.

Introduction
Related Work
Mesh Reconstruction from NeRF
Best Views Selection in 3d Scenes
Method
Framework
Efficient 3d Scene Initialization (Stage 1)
UCB-based Adaptive Viewpoint Enhancement (Stage 2)
Geometry and Appearance Refinement (Stage 2)
Loss Function
Regularization
Experiments
Implementation Details
Datasets
Evaluation
...and 4 more sections

Figures (5)

Figure 1: Visualization of our overall performance. Our method achieves highly competitive and robust performance in both mesh rendering quality and geometric quality.
Figure 2: Our Framework. In stage 1, we initialize the geometry and view-dependent appearance representation based on NeRF. This initial phase results in a coarse SDF grid and a set of candidate viewpoints rendered by NeRF model for enhancement in the subsequent stage. Then in stage 2, for each training iteration, we take two steps. We first select the optimal combination of viewpoints based on UCB strategy to incorporate into the training dataset. We then simultaneously refine both the geometry and the appearance representation. After training is complete, we obtain the final mesh.
Figure 3: Mesh reconstruction quality on NeRF-synthetic dataset. Our results significantly outperform the previous works.
Figure 4: Visualization of rendering quality. Our method can render finer details compared to the NeRF2Mesh's approach.
Figure 5: Fine mesh compared to the coarse mesh.

$R^2$-Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement

TL;DR

Abstract

$R^2$-Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement

Authors

TL;DR

Abstract

Table of Contents

Figures (5)