Pose Estimation of Buried Deep-Sea Objects using 3D Vision Deep Learning Models
Jerry Yan, Chinmay Talegaonkar, Nicholas Antipa, Eric Terrill, Sophia Merrifield
TL;DR
This work tackles the challenge of estimating the pose and burial fraction of buried seabed barrels from ROV imagery in the San Pedro Basin. It proposes a learning-based pipeline that combines underwater 3D reconstruction (DUSt3R) with segmentation (Grounding DINO + SAM) and BarrelNet, a modified PointNet, to predict the barrel axis $\vec{\mathbf{n}}$, radius $r$, and centroid $\boldsymbol{\mathbf{c}}$, with burial fraction $b_f$ computed via Monte Carlo sampling. BarrelNet is trained exclusively on synthetically generated, occluded cylinder point clouds to model burial effects and is shown to dramatically outperform classical cylinder fitting in synthetic tests, with qualitative transfer to real ROV data. The framework enables robust quantification of buried debris impact on marine environments and provides a basis for extending pose estimation to other underwater objects in future work.
Abstract
We present an approach for pose and burial fraction estimation of debris field barrels found on the seabed in the Southern California San Pedro Basin. Our computational workflow leverages recent advances in foundation models for segmentation and a vision transformer-based approach to estimate the point cloud which defines the geometry of the barrel. We propose BarrelNet for estimating the 6-DOF pose and radius of buried barrels from the barrel point clouds as input. We train BarrelNet using synthetically generated barrel point clouds, and qualitatively demonstrate the potential of our approach using remotely operated vehicle (ROV) video footage of barrels found at a historic dump site. We compare our method to a traditional least squares fitting approach and show significant improvement according to our defined benchmarks.
