Reward-Based Collision-Free Algorithm for Trajectory Planning of Autonomous Robots
Jose D. Hoyos, Tianyu Zhou, Zehui Lu, Shaoshuai Mou
TL;DR
The paper tackles reward-based mission planning for autonomous robots where a sequence of weighted waypoints must be chosen to maximize collected rewards under obstacle avoidance and robot dynamics. It introduces a two-layer framework: a genetic algorithm that optimizes waypoint sequences with a penalty-based fitness function, and a trajectory generator using differential flatness and clothoid curves to ensure dynamically feasible, curvature-continuous paths between waypoints; obstacle-free subpaths are established via A*, and the genetic crossover leverages dynamic time warping with extended convex combinations and projection. Key contributions include a two-strategy crossover for variable-length discrete sequences, a differential-flatness-based mapping from flat outputs to states/inputs, constraint-violation metrics integrated into the fitness, and extensive hardware validation on a ground vehicle, a quadrotor, and a quadruped, plus a time-complexity study showing empirical $O(n^2)$ scaling. The results demonstrate improved scalability and trajectory smoothness over jerk/snap-based methods and competitive performance against MINLP solvers, highlighting practical impact for real-world autonomous tasks with limited resources and varying waypoint rewards. The framework supports flexible task prioritization, robust collision avoidance, and efficient planning suitable for embedded deployment, with future directions including dynamic obstacles, multi-robot extensions, and real-time re-planning.
Abstract
This paper proposes a novel mission planning algorithm for autonomous robots that selects an optimal waypoint sequence from a predefined set to maximize total reward while satisfying obstacle avoidance, state, input, derivative, mission time, and distance constraints. The formulation extends the prize-collecting traveling salesman problem. A tailored genetic algorithm evolves candidate solutions using a fitness function, crossover, and mutation, with constraint enforcement via a penalty method. Differential flatness and clothoid curves are employed to penalize infeasible trajectories efficiently, while the Euler spiral method ensures curvature-continuous trajectories with bounded curvature, enhancing dynamic feasibility and mitigating oscillations typical of minimum-jerk and snap parameterizations. Due to the discrete variable length optimization space, crossover is performed using a dynamic time-warping-based method and extended convex combination with projection. The algorithm's performance is validated through simulations and experiments with a ground vehicle, quadrotor, and quadruped, supported by benchmarking and time-complexity analysis.
