Experimental investigation of pose informed reinforcement learning for skid-steered visual navigation
Ameya Salvi, Venkat Krovi
TL;DR
This work tackles vision-based lane keeping for skid-steered robots by introducing a pose-informed DRL framework that uses waypoint-guided learning with arc-length clothoid paths. It formalizes the problem as an MDP with a structured track, a clothoid-based reference path, and a reward that jointly penalizes pose, velocity, and action effort; learning can be conducted in IK or end-to-end modes. Through extensive simulations and hardware experiments, it demonstrates that waypoint-guided learning yields robust sim-to-real transfer, outperforms several formal methods on high-curvature tracks, and generalizes beyond cone markers to unseen visual features. The study also analyzes the effects of waypoint spacing, look-ahead adjustments, and sensor dropouts, offering practical insights for deploying pose-informed policies in real autonomy stacks.
Abstract
Vision-based lane keeping is a topic of significant interest in the robotics and autonomous ground vehicles communities in various on-road and off-road applications. The skid-steered vehicle architecture has served as a useful vehicle platform for human controlled operations. However, systematic modeling, especially of the skid-slip wheel terrain interactions (primarily in off-road settings) has created bottlenecks for automation deployment. End-to-end learning based methods such as imitation learning and deep reinforcement learning, have gained prominence as a viable deployment option to counter the lack of accurate analytical models. However, the systematic formulation and subsequent verification/validation in dynamic operation regimes (particularly for skid-steered vehicles) remains a work in progress. To this end, a novel approach for structured formulation for learning visual navigation is proposed and investigated in this work. Extensive software simulations, hardware evaluations and ablation studies now highlight the significantly improved performance of the proposed approach against contemporary literature.
