Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera

Yu Hu; Yuang Zhang; Yunlong Song; Yang Deng; Feng Yu; Linzuo Zhang; Weiyao Lin; Danping Zou; Wenxian Yu

Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera

Yu Hu, Yuang Zhang, Yunlong Song, Yang Deng, Feng Yu, Linzuo Zhang, Weiyao Lin, Danping Zou, Wenxian Yu

TL;DR

This paper tackles obstacle avoidance for quadrotors using monocular optical flow, addressing the limitations of depth-based sensing and flow ambiguity. It introduces an end-to-end framework that maps optical flow to control via a differentiable simulator, augmented by central flow attention and action-guided active sensing, enabling agile flight up to $6$ m/s. A GPU-based differentiable simulator supports Backpropagation Through Time, allowing end-to-end policy optimization and zero-shot sim-to-real transfer, demonstrated on real FPV hardware. Results show robust high-speed navigation in unknown cluttered environments, while identifying remaining gaps due to optical-flow noise and rotational effects near the FoE that limit performance relative to depth-based approaches.

Abstract

Optical flow captures the motion of pixels in an image sequence over time, providing information about movement, depth, and environmental structure. Flying insects utilize this information to navigate and avoid obstacles, allowing them to execute highly agile maneuvers even in complex environments. Despite its potential, autonomous flying robots have yet to fully leverage this motion information to achieve comparable levels of agility and robustness. Challenges of control from optical flow include extracting accurate optical flow at high speeds, handling noisy estimation, and ensuring robust performance in complex environments. To address these challenges, we propose a novel end-to-end system for quadrotor obstacle avoidance using monocular optical flow. We develop an efficient differentiable simulator coupled with a simplified quadrotor model, allowing our policy to be trained directly through first-order gradient optimization. Additionally, we introduce a central flow attention mechanism and an action-guided active sensing strategy that enhances the policy's focus on task-relevant optical flow observations to enable more responsive decision-making during flight. Our system is validated both in simulation and the real world using an FPV racing drone. Despite being trained in a simple environment in simulation, our system is validated both in simulation and the real world using an FPV racing drone. Despite being trained in a simple environment in simulation, our system demonstrates agile and robust flight in various unknown, cluttered environments in the real world at speeds of up to 6m/s.

Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera

TL;DR

Abstract

Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)