UAS Visual Navigation in Large and Unseen Environments via a Meta Agent

Yuci Han; Charles Toth; Alper Yilmaz

UAS Visual Navigation in Large and Unseen Environments via a Meta Agent

Yuci Han, Charles Toth, Alper Yilmaz

TL;DR

This work addresses long-range, monocular-vision UAS navigation in large urban environments and the challenge of transferring learned policies to unseen areas. It introduces a two-stage meta-curriculum framework that first meta-trains a master policy over multiple tasks and then fine-tunes it through a hierarchical coarse-to-fine curriculum, complemented by Incremental Self-Adaptive Reinforcement Learning (ISAR) to speed up learning for long-horizon tasks. ISAR combines inner-episode interaction loss with an adaptive loss to perform incremental policy updates across short trajectory windows, compatible with base RL algorithms such as A3C or PPO. Empirical results in the AirSim simulator show faster convergence and robust transfer to unseen environments, indicating significant reductions in training cost and improved adaptability for real-world urban navigation tasks.

Abstract

The aim of this work is to develop an approach that enables Unmanned Aerial System (UAS) to efficiently learn to navigate in large-scale urban environments and transfer their acquired expertise to novel environments. To achieve this, we propose a meta-curriculum training scheme. First, meta-training allows the agent to learn a master policy to generalize across tasks. The resulting model is then fine-tuned on the downstream tasks. We organize the training curriculum in a hierarchical manner such that the agent is guided from coarse to fine towards the target task. In addition, we introduce Incremental Self-Adaptive Reinforcement learning (ISAR), an algorithm that combines the ideas of incremental learning and meta-reinforcement learning (MRL). In contrast to traditional reinforcement learning (RL), which focuses on acquiring a policy for a specific task, MRL aims to learn a policy with fast transfer ability to novel tasks. However, the MRL training process is time consuming, whereas our proposed ISAR algorithm achieves faster convergence than the conventional MRL algorithm. We evaluate the proposed methodologies in simulated environments and demonstrate that using this training philosophy in conjunction with the ISAR algorithm significantly improves the convergence speed for navigation in large-scale cities and the adaptation proficiency in novel environments.

UAS Visual Navigation in Large and Unseen Environments via a Meta Agent

TL;DR

Abstract

UAS Visual Navigation in Large and Unseen Environments via a Meta Agent

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)