DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery
Jiadong Tang, Yu Gao, Dianyi Yang, Liqi Yan, Yufeng Yue, Yi Yang
TL;DR
DroneSplat addresses robust 3D reconstruction from in-the-wild drone imagery by tackling dynamic distractors and limited-view constraints. It combines Adaptive Local-Global Masking to detect and suppress dynamic regions with voxel-guided optimization that leverages dense priors from multi-view stereo (via DUSt3R) to constrain Gaussian primitives. The method integrates geometric priors, per-voxel constraints, and segmentation-guided masking to achieve high-quality static reconstructions in challenging wild scenes, outperforming NeRF-based and 3DGS baselines across dynamic and sparse-view scenarios. A drone-captured 24-sequence dataset further demonstrates the approach's practical value for real-world aerial reconstruction tasks, with implications for urban surveying and cultural heritage preservation.
Abstract
Drones have become essential tools for reconstructing wild scenes due to their outstanding maneuverability. Recent advances in radiance field methods have achieved remarkable rendering quality, providing a new avenue for 3D reconstruction from drone imagery. However, dynamic distractors in wild environments challenge the static scene assumption in radiance fields, while limited view constraints hinder the accurate capture of underlying scene geometry. To address these challenges, we introduce DroneSplat, a novel framework designed for robust 3D reconstruction from in-the-wild drone imagery. Our method adaptively adjusts masking thresholds by integrating local-global segmentation heuristics with statistical approaches, enabling precise identification and elimination of dynamic distractors in static scenes. We enhance 3D Gaussian Splatting with multi-view stereo predictions and a voxel-guided optimization strategy, supporting high-quality rendering under limited view constraints. For comprehensive evaluation, we provide a drone-captured 3D reconstruction dataset encompassing both dynamic and static scenes. Extensive experiments demonstrate that DroneSplat outperforms both 3DGS and NeRF baselines in handling in-the-wild drone imagery.
