Seeing through Uncertainty: Robust Task-Oriented Optimization in Visual Navigation
Yiyuan Pan, Yunzhe Xu, Zhe Liu, Hesheng Wang
TL;DR
NeuRO tackles long-horizon visual navigation under data scarcity and partial observability by fusing a Neural Perception Module with a Robust Optimization Planner. It leverages Partially Input Convex Neural Networks (PICNNs) for conformal calibration to produce convex uncertainty sets, and casts POMDP-like planning as robust optimization, with gradient flow enabled through KKT-based implicit differentiation and a Goal Vector Method to balance task and environment rewards. The solution feedback loop refines actions and rewards from the optimization output, allowing end-to-end training and transferability to different task formulations (U-MON and S-MON). Empirical results on unordered and sequential MultiON benchmarks show NeuRO achieving SoTA performance and improved generalization in unseen environments, with ablations and scalability analyses highlighting practical tradeoffs and robustness benefits.
Abstract
Visual navigation is a fundamental problem in embodied AI, yet practical deployments demand long-horizon planning capabilities to address multi-objective tasks. A major bottleneck is data scarcity: policies learned from limited data often overfit and fail to generalize OOD. Existing neural network-based agents typically increase architectural complexity that paradoxically become counterproductive in the small-sample regime. This paper introduce NeuRO, a integrated learning-to-optimize framework that tightly couples perception networks with downstream task-level robust optimization. Specifically, NeuRO addresses core difficulties in this integration: (i) it transforms noisy visual predictions under data scarcity into convex uncertainty sets using Partially Input Convex Neural Networks (PICNNs) with conformal calibration, which directly parameterize the optimization constraints; and (ii) it reformulates planning under partial observability as a robust optimization problem, enabling uncertainty-aware policies that transfer across environments. Extensive experiments on both unordered and sequential multi-object navigation tasks demonstrate that NeuRO establishes SoTA performance, particularly in generalization to unseen environments. Our work thus presents a significant advancement for developing robust, generalizable autonomous agents.
