Sub-Image Recapture for Multi-View 3D Reconstruction
Yanwei Wang
TL;DR
This paper tackles the memory bottleneck in learning-based multi-view stereo when reconstructing high-resolution scenes. It introduces Sub-Image Recapture (SIR), a universal strategy that splits large images into sub-images, assigns each a synthesized recapture camera, and processes them with existing SfM/MVS pipelines while preserving native resolution. The approach reduces per-step memory, enables more overlap-driven depth estimates, and improves scalability and accuracy on large imagery, demonstrated on high-resolution airborne data using COLMAP and PatchMatchNet. The work enables applying state-of-the-art MVS to arbitrarily large images and provides an open-source implementation for practitioners.
Abstract
3D reconstruction of high-resolution target remains a challenge task due to the large memory required from the large input image size. Recently developed learning based algorithms provide promising reconstruction performance than traditional ones, however, they generally require more memory than the traditional algorithms and facing scalability issue. In this paper, we developed a generic approach, sub-image recapture (SIR), to split large image into smaller sub-images and process them individually. As a result of this framework, the existing 3D reconstruction algorithms can be implemented based on sub-image recapture with significantly reduced memory and substantially improved scalability
