Table of Contents
Fetching ...

IntentReact: Guiding Reactive Object-Centric Navigation via Topological Intent

Yanmei Jiao, Anpeng Lu, Wenhan Hu, Rong Xiong, Yue Wang, Huajin Tang, Wen-an Zhang

Abstract

Object-goal visual navigation requires robots to reason over semantic structure and act effectively under partial observability. Recent approaches based on object-level topological maps enable long-horizon navigation without dense geometric reconstruction, but their execution remains limited by the gap between global topological guidance and local perception-driven control. In particular, local decisions are made solely from the current egocentric observation, without access to information beyond the robot's field of view. As a result, the robot may persist along its current heading even when initially oriented away from the goal, moving toward directions that do not decrease the global topological distance. In this work, we propose IntentReact, an intent-conditioned object-centric navigation framework that introduces a compact interface between global topological planning and reactive object-centric control. Our approach encodes global topological guidance as a low-dimensional directional signal, termed intent, which conditions a learned waypoint prediction policy to bias navigation toward topologically consistent progression. This design enables the robot to promptly reorient when local observations are misleading, guiding motion toward directions that decrease global topological distance while preserving the reactivity and robustness of object-centric control. We evaluate the proposed framework through extensive experiments, demonstrating improved navigation success and execution quality compared to prior object-centric navigation methods.

IntentReact: Guiding Reactive Object-Centric Navigation via Topological Intent

Abstract

Object-goal visual navigation requires robots to reason over semantic structure and act effectively under partial observability. Recent approaches based on object-level topological maps enable long-horizon navigation without dense geometric reconstruction, but their execution remains limited by the gap between global topological guidance and local perception-driven control. In particular, local decisions are made solely from the current egocentric observation, without access to information beyond the robot's field of view. As a result, the robot may persist along its current heading even when initially oriented away from the goal, moving toward directions that do not decrease the global topological distance. In this work, we propose IntentReact, an intent-conditioned object-centric navigation framework that introduces a compact interface between global topological planning and reactive object-centric control. Our approach encodes global topological guidance as a low-dimensional directional signal, termed intent, which conditions a learned waypoint prediction policy to bias navigation toward topologically consistent progression. This design enables the robot to promptly reorient when local observations are misleading, guiding motion toward directions that decrease global topological distance while preserving the reactivity and robustness of object-centric control. We evaluate the proposed framework through extensive experiments, demonstrating improved navigation success and execution quality compared to prior object-centric navigation methods.

Paper Structure

This paper contains 17 sections, 7 equations, 6 figures, 4 tables.

Figures (6)

  • Figure 1: Existing methods exhibit short-sighted behavior due to the decoupling of global planning and local control, leading to locally reasonable yet globally inefficient or even incorrect trajectories: (a) excessive subgoal attraction results in zigzag trajectories and low overall efficiency; (b) large initial heading errors cause persistent wrong-direction motion. Our method introduces a low-dimensional intent to condition the waypoint policy, enforcing topologically consistent progression and enabling efficient correction, as shown in (c).
  • Figure 2: IntentReact Navigation pipeline. (1) We construct an object-level topological map with 3D node information. (2) During online execution, the current observation is matched to the map to obtain a set of query nodes. (3) For each query node, global planning is performed to obtain a shortest path, from which an object-level costmap is built and an intent vector indicating the direction of decreasing topological distance is computed and fed into the local control stage. (4) The intent is incorporated as a conditional signal to modulate the controller’s feature representation, enabling global guidance rather than hard intervention. (5) A BEV costmap is further used to correct the feasibility of the predicted waypoint, forming a unified closed loop between learned control and geometric feasibility.
  • Figure 3: Comparison of different conditioning strategies. (a) Global path conditioning leverages full map or planned path information, but introduces redundancy and weakens local reactivity. (b) Goal-direction conditioning provides a simple directional cue, but may lead to premature turning and local collisions. (c) The proposed 2-hop intent conditioning encodes a compact topological signal, effectively balancing global consistency and local feasibility.
  • Figure 4: Qualitative results under the Shortcut setting. When the initial heading deviates from the goal direction, RoboHop, TANGO, and ObjectReact often follow misleading local cues, resulting in inefficient or failed trajectories. In contrast, our method leverages the intent signal to correct early deviations and maintain topologically consistent progression. In cases without ambiguity, it behaves identically to ObjectReact, indicating that intent acts as a soft bias without disrupting the underlying reactive policy.
  • Figure 5: Step-by-step visualization of our method for the first case in Fig. \ref{['fig:case']}. At each step, four panels show the RGB observation, segmentation, BEV waypoint, and intent direction. Despite a large initial heading error, intent provides consistent directional bias, enabling rapid correction toward the goal (step 24). BEV refinement further ensures safe execution by projecting raw waypoints onto feasible regions (step 75).
  • ...and 1 more figures