SonarSweep: Fusing Sonar and Vision for Robust 3D Reconstruction via Plane Sweeping

Lingpeng Chen; Jiakun Tang; Apple Pui-Yi Chui; Ziyang Hong; Junfeng Wu

SonarSweep: Fusing Sonar and Vision for Robust 3D Reconstruction via Plane Sweeping

Lingpeng Chen, Jiakun Tang, Apple Pui-Yi Chui, Ziyang Hong, Junfeng Wu

TL;DR

The paper tackles robust 3D underwater reconstruction under challenging turbidity by fusing imaging sonar and camera data. It introduces SonarSweep, an end-to-end framework that adapts deep plane sweep to cross-modal fusion, back-projecting sonar features onto sonar-aligned planes and warping them into the camera view to build a multi-modal cost volume. Through extensive sim-to-real experiments, it demonstrates state-of-the-art dense depth accuracy and robustness across distance and turbidity, and releases a synchronized stereo-camera and imaging sonar dataset along with code. This approach holds practical significance for reliable autonomous underwater perception and mapping, especially in visually degraded environments.

Abstract

Accurate 3D reconstruction in visually-degraded underwater environments remains a formidable challenge. Single-modality approaches are insufficient: vision-based methods fail due to poor visibility and geometric constraints, while sonar is crippled by inherent elevation ambiguity and low resolution. Consequently, prior fusion technique relies on heuristics and flawed geometric assumptions, leading to significant artifacts and an inability to model complex scenes. In this paper, we introduce SonarSweep, a novel, end-to-end deep learning framework that overcomes these limitations by adapting the principled plane sweep algorithm for cross-modal fusion between sonar and visual data. Extensive experiments in both high-fidelity simulation and real-world environments demonstrate that SonarSweep consistently generates dense and accurate depth maps, significantly outperforming state-of-the-art methods across challenging conditions, particularly in high turbidity. To foster further research, we will publicly release our code and a novel dataset featuring synchronized stereo-camera and sonar data, the first of its kind.

SonarSweep: Fusing Sonar and Vision for Robust 3D Reconstruction via Plane Sweeping

TL;DR

Abstract

SonarSweep: Fusing Sonar and Vision for Robust 3D Reconstruction via Plane Sweeping

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)