A Reinforcement Learning Approach to Quiet and Safe UAM Traffic Management
Surya Murthy, John-Paul Clarke, Ufuk Topcu, Zhenyu Gao
TL;DR
The paper tackles quiet and safe urban air mobility traffic management by formulating a multi-agent MDp with altitude-change actions and a reward that combines $R_{noise}$ and $R_{separation}$. It utilizes the attention-enhanced D2MAV-A policy, trained with PPO in a centralized-to-decentralized framework, within BlueSky simulations and NASA RVLT-based noise data. The contributions include a joint noise-safety objective, a scalable RL framework for real-time control, and empirical insights from a South Austin network showing the tradeoffs between altitude distribution, congestion, LOS events, and ground noise. The findings demonstrate that altitude-based adjustments can balance environmental and safety goals, informing policy design for practical, city-scale UAM deployment.
Abstract
Urban air mobility (UAM) is a transformative system that operates various small aerial vehicles in urban environments to reshape urban transportation. However, integrating UAM into existing urban environments presents a variety of complex challenges. Recent analyses of UAM's operational constraints highlight aircraft noise and system safety as key hurdles to UAM system implementation. Future UAM air traffic management schemes must ensure that the system is both quiet and safe. We propose a multi-agent reinforcement learning approach to manage UAM traffic, aiming at both vertical separation assurance and noise mitigation. Through extensive training, the reinforcement learning agent learns to balance the two primary objectives by employing altitude adjustments in a multi-layer UAM network. The results reveal the tradeoffs among noise impact, traffic congestion, and separation. Overall, our findings demonstrate the potential of reinforcement learning in mitigating UAM's noise impact while maintaining safe separation using altitude adjustments
