Terrain Costmap Generation via Scaled Preference Conditioning
Luisa Mao, Garret Warnell, Peter Stone, Joydeep Biswas
TL;DR
SPACER addresses the dual challenge of generalizing terrain costmaps to unseen environments while enabling rapid test-time control of relative terrain costs. It achieves this by conditioning costmap generation on a scaled preference context and training on a synthetic dataset, using a latent UNet architecture and a pretrained Stable Diffusion VAE, with an inverse Bradley–Terry formulation to link preference strength to cost differences. The approach yields superior planning performance and costmap accuracy across diverse aerial environments, outperforming semantic segmentation and prior PbIRL baselines in most settings, and enabling fine-grained operator control without retraining. This has practical impact for autonomous off-road navigation, as operators can rapidly adapt to mission goals by adjusting preference strengths, with robust generalization to novel terrains.
Abstract
Successful autonomous robot navigation in off-road domains requires the ability to generate high-quality terrain costmaps that are able to both generalize well over a wide variety of terrains and rapidly adapt relative costs at test time to meet mission-specific needs. Existing approaches for costmap generation allow for either rapid test-time adaptation of relative costs (e.g., semantic segmentation methods) or generalization to new terrain types (e.g., representation learning methods), but not both. In this work, we present scaled preference conditioned all-terrain costmap generation (SPACER), a novel approach for generating terrain costmaps that leverages synthetic data during training in order to generalize well to new terrains, and allows for rapid test-time adaptation of relative costs by conditioning on a user-specified scaled preference context. Using large-scale aerial maps, we provide empirical evidence that SPACER outperforms other approaches at generating costmaps for terrain navigation, with the lowest measured regret across varied preferences in five of seven environments for global path planning.
