Learning Agile Swimming: An End-to-End Approach without CPGs
Xiaozhu Lin, Xiaopei Liu, Yang Wang
TL;DR
The paper addresses the challenge of achieving agile, energy-efficient swimming in bio-mimetic robotic fish by proposing a model-free, end-to-end DRL framework that directly outputs low-level actuator commands without relying on Central Pattern Generators (CPGs). Training occurs in a high-performance CFD-based FishGym simulator with sim-to-real calibration techniques, enabling zero-shot transfer of policies to real hardware. Key contributions include eliminating the need for dynamic models or predefined gaits, demonstrating superior speed and maneuverability with reduced energy, and validating zero-shot transfer on challenging tasks such as a 180-degree turn and pentagram waypoint tracking. The approach promises practical impact for deploying robotic fish in real aquatic environments by substantially narrowing the sim-to-real gap and simplifying controller design for fluid-structure interaction systems.
Abstract
The pursuit of agile and efficient underwater robots, especially bio-mimetic robotic fish, has been impeded by challenges in creating motion controllers that are able to fully exploit their hydrodynamic capabilities. This paper addresses these challenges by introducing a novel, model-free, end-to-end control framework that leverages Deep Reinforcement Learning (DRL) to enable agile and energy-efficient swimming of robotic fish. Unlike existing methods that rely on predefined trigonometric swimming patterns like Central Pattern Generators (CPG), our approach directly outputs low-level actuator commands without strong constraints, enabling the robotic fish to learn agile swimming behaviors. In addition, by integrating a high-performance Computational Fluid Dynamics (CFD) simulator with innovative sim-to-real strategies, such as normalized density calibration and servo response calibration, the proposed framework significantly mitigates the sim-to-real gap, facilitating direct transfer of control policies to real-world environments without fine-tuning. Comparative experiments demonstrate that our method achieves faster swimming speeds, smaller turn-around radii, and reduced energy consumption compared to the state-of-the-art swimming controllers. Furthermore, the proposed framework shows promise in addressing complex tasks, paving the way for more effective deployment of robotic fish in real aquatic environments.
