Learning Which Side to Scan: Multi-View Informed Active Perception with Side Scan Sonar for Autonomous Underwater Vehicles
Advaith V. Sethuraman, Philip Baldoni, Katherine A. Skinner, James McMahon
TL;DR
The paper tackles efficient underwater object classification with side-scan sonar by formulating Adaptive Surveying and Reacquisition (ASR) as a graph-based active perception problem. It introduces an Angular View Graph (AVG) to encode multiple viewing angles and relations, a Graph Multi-View ATR (GMVATR) for joint view-based recognition, and a Deep Q-Network policy to select the next best view, all evaluated in a photorealistic sonar simulator. Key contributions include the AVG formulation, a novel reward structure that minimizes the number of views while ensuring correct classification, and extensive experiments showing improved accuracy, coverage rate, and classification efficiency over state-of-the-art baselines. The approach promises more efficient autonomous missions in underwater exploration, archaeology, and environmental monitoring, with future work focusing on sim2real transfer and expanding the action space for more flexible reacquisition strategies.
Abstract
Autonomous underwater vehicles often perform surveys that capture multiple views of targets in order to provide more information for human operators or automatic target recognition algorithms. In this work, we address the problem of choosing the most informative views that minimize survey time while maximizing classifier accuracy. We introduce a novel active perception framework for multi-view adaptive surveying and reacquisition using side scan sonar imagery. Our framework addresses this challenge by using a graph formulation for the adaptive survey task. We then use Graph Neural Networks (GNNs) to both classify acquired sonar views and to choose the next best view based on the collected data. We evaluate our method using simulated surveys in a high-fidelity side scan sonar simulator. Our results demonstrate that our approach is able to surpass the state-of-the-art in classification accuracy and survey efficiency. This framework is a promising approach for more efficient autonomous missions involving side scan sonar, such as underwater exploration, marine archaeology, and environmental monitoring.
