Navigating the Wild: Pareto-Optimal Visual Decision-Making in Image Space
Durgakant Pushp, Weizhe Chen, Zheng Chen, Chaomin Luo, Jason M. Gregory, Lantao Liu
TL;DR
The paper addresses the challenge of robust navigation in cluttered real-world environments without relying on explicit maps or heavy training data. It proposes Pareto-Optimal Visual Navigation (POVNav), a two-module framework that uses semantic segmentation to build a navigability image and an image-space planner that selects a Horizon Optic Goal (HOG) from the visual horizon via Pareto-based scalarization, followed by visual servoing. The authors provide theoretical guarantees (weak Pareto optimality and local stability) and analyze computational complexity, complemented by extensive simulations and real-world experiments across indoor/outdoor, static/dynamic, and adverse conditions. The work demonstrates improved success rates and shorter paths than baselines, with robust performance under segmentation noise and domain shifts, and includes practical details for real-time deployment and a release of code. The approach offers a scalable, modular alternative to fully map-based or purely end-to-end methods, enabling reliable, low-compute navigation in varied environments with potentially dynamic obstacles.
Abstract
Navigating complex real-world environments requires semantic understanding and adaptive decision-making. Traditional reactive methods without maps often fail in cluttered settings, map-based approaches demand heavy mapping effort, and learning-based solutions rely on large datasets with limited generalization. To address these challenges, we present Pareto-Optimal Visual Navigation, a lightweight image-space framework that combines data-driven semantics, Pareto-optimal decision-making, and visual servoing for real-time navigation.
