A Deep Reinforcement Learning Framework and Methodology for Reducing the Sim-to-Real Gap in ASV Navigation
Luis F W Batista, Junghwan Ro, Antoine Richard, Pete Schroepfer, Seth Hutchinson, Cedric Pradalier
TL;DR
The paper tackles the sim-to-real gap in DRL-controlled ASVs by integrating buoyancy and hydrodynamics into a GPU-accelerated RL framework and by employing system identification with domain randomization to improve real-world transfer. Through both simulation and real-world experiments on floating-waste capture, the authors show that SID-DR reduces energy consumption by 13.1% and accelerates task completion by 7.4% compared to nominal approaches, underscoring the practical benefits of more accurate hydrodynamic modeling and robust training. The contributions include an open-source, parallelizable simulation platform and a demonstrated methodology for aligning simulated dynamics with real-world behavior, enhancing efficiency and versatility of autonomous marine operations. These results have potential environmental and operational impact by enabling more capable and energy-efficient ASVs for cleaning and monitoring aquatic environments.
Abstract
Despite the increasing adoption of Deep Reinforcement Learning (DRL) for Autonomous Surface Vehicles (ASVs), there still remain challenges limiting real-world deployment. In this paper, we first integrate buoyancy and hydrodynamics models into a modern Reinforcement Learning framework to reduce training time. Next, we show how system identification coupled with domain randomization improves the RL agent performance and narrows the sim-to-real gap. Real-world experiments for the task of capturing floating waste show that our approach lowers energy consumption by 13.1\% while reducing task completion time by 7.4\%. These findings, supported by sharing our open-source implementation, hold the potential to impact the efficiency and versatility of ASVs, contributing to environmental conservation efforts.
