PA-MPPI: Perception-Aware Model Predictive Path Integral Control for Quadrotor Navigation in Unknown Environments
Yifan Zhai, Rudolf Reiter, Davide Scaramuzza
TL;DR
PA-MPPI tackles autonomous quadrotor navigation in unknown environments by augmenting Model Predictive Path Integral control with a perception-driven cost that steers trajectory samples toward informative frontiers. The approach tightly integrates a perception and mapping module (ROG-Map) with a ray-tracing frontier mechanism, enabling real-time, reference-free planning at 50 Hz. Hardware and simulated experiments show PA-MPPI performs on par with state-of-the-art planners like SUPER and serves as a robust action policy for navigation foundation models. Overall, the work advances perception-aware optimization for agile, safe navigation in cluttered, partially observed spaces and suggests avenues for longer-horizon planning and integration with learned planning systems.
Abstract
Quadrotor navigation in unknown environments is critical for practical missions such as search-and-rescue. Solving this problem requires addressing three key challenges: path planning in non-convex free space due to obstacles, satisfying quadrotor-specific dynamics and objectives, and exploring unknown regions to expand the map. Recently, the Model Predictive Path Integral (MPPI) method has emerged as a promising solution to the first two challenges. By leveraging sampling-based optimization, it can effectively handle non-convex free space while directly optimizing over the full quadrotor dynamics, enabling the inclusion of quadrotor-specific costs such as energy consumption. However, MPPI has been limited to tracking control that optimizes trajectories only within a small neighborhood around a reference trajectory, as it lacks the ability to explore unknown regions and plan alternative paths when blocked by large obstacles. To address this limitation, we introduce Perception-Aware MPPI (PA-MPPI). In this approach, perception-awareness is characterized by planning and adapting the trajectory online based on perception objectives. Specifically, when the goal is occluded, PA-MPPI incorporates a perception cost that biases trajectories toward those that can observe unknown regions. This expands the mapped traversable space and increases the likelihood of finding alternative paths to the goal. Through hardware experiments, we demonstrate that PA-MPPI, running at 50 Hz, performs on par with the state-of-the-art quadrotor navigation planner for unknown environments in challenging test scenarios. Furthermore, we show that PA-MPPI can serve as a safe and robust action policy for navigation foundation models, which often provide goal poses that are not directly reachable.
