PACER: Preference-conditioned All-terrain Costmap Generation
Luisa Mao, Garrett Warnell, Peter Stone, Joydeep Biswas
TL;DR
PACER addresses the challenge of rapidly adapting terrain costs to operator preferences by learning a preference-conditioned BEV costmap from a deployment-time preference context. It casts costmap generation as learning a function $R:\mathcal{I}\times\mathcal{H}\to\mathcal{C}$ that aligns with the unknown human cost $H:\mathcal{T}\to\mathbb{R}$ under equivalence and partial-ordering constraints, enabling navigation in $SE(2)$. The authors propose a two-encoder encoder–decoder network that processes a BEV image and a structured preference context $\hat{H}$, trained with pixel-wise loss across real, shuffled, and synthetic data in three phases. Empirical results from aerial-map simulations and real-robot deployments show that PACER generalizes to unseen terrains, adapts to new preferences without fine-tuning, and outperforms semantics-based and representation-learning baselines in producing navigation-friendly costmaps.
Abstract
In autonomous robot navigation, terrain cost assignment is typically performed using a semantics-based paradigm in which terrain is first labeled using a pre-trained semantic classifier and costs are then assigned according to a user-defined mapping between label and cost. While this approach is rapidly adaptable to changing user preferences, only preferences over the types of terrain that are already known by the semantic classifier can be expressed. In this paper, we hypothesize that a machine-learning-based alternative to the semantics-based paradigm above will allow for rapid cost assignment adaptation to preferences expressed over new terrains at deployment time without the need for additional training. To investigate this hypothesis, we introduce and study PACER, a novel approach to costmap generation that accepts as input a single birds-eye view (BEV) image of the surrounding area along with a user-specified preference context and generates a corresponding BEV costmap that aligns with the preference context. Using both real and synthetic data along with a combination of proposed training tasks, we find that PACER is able to adapt quickly to new user preferences while also exhibiting better generalization to novel terrains compared to both semantics-based and representation-learning approaches.
