FLYINGTRUST: A Benchmark for Quadrotor Navigation Across Scenarios and Vehicles

Gang Li; Chunlei Zhai; Teng Wang; Shaun Li; Shangsong Jiang; Xiangwei Zhu

FLYINGTRUST: A Benchmark for Quadrotor Navigation Across Scenarios and Vehicles

Gang Li, Chunlei Zhai, Teng Wang, Shaun Li, Shangsong Jiang, Xiangwei Zhu

TL;DR

FLYINGTRUST proposes a simulation-first benchmark to quantify how quadrotor navigation robustness depends on both platform kinodynamics and scenario geometry. It defines two interpretable kinodynamic indicators, $TWR_{max}$ and $\alpha_{xy,max}, \alpha_{z,max}$, and derives a compact performance metric $P = TWR_{max} \cdot \alpha_{xy,max} \cdot \alpha_{z,max}$ to characterize platform capability. The framework combines 18 real and 18 virtual platform profiles with seven navigation scenarios to produce 252 platform-scenario combinations evaluated over multiple trials, using a composite, uncertainty-aware score that weighs scenario and platform importance and penalizes instability. A diverse set of optimization-based and learning-based navigation methods are benchmarked, revealing systematic interactions between kinodynamics and scene geometry and highlighting distinct failure modes across algorithms. The results underscore the need to design and evaluate navigation methods that remain robust across heterogeneous platforms and scenarios, guiding safer and more cost-effective real-world deployment.

Abstract

Visual navigation algorithms for quadrotors often exhibit a large variation in performance when transferred across different vehicle platforms and scene geometries, which increases the cost and risk of field deployment. To support systematic early-stage evaluation, we introduce FLYINGTRUST, a high-fidelity, configurable benchmarking framework that measures how platform kinodynamics and scenario structure jointly affect navigation robustness. FLYINGTRUST models vehicle capability with two compact, physically interpretable indicators: maximum thrust-to-weight ratio and axis-wise maximum angular acceleration. The benchmark pairs a diverse scenario library with a heterogeneous set of real and virtual platforms and prescribes a standardized evaluation protocol together with a composite scoring method that balances scenario importance, platform importance and performance stability. We use FLYINGTRUST to compare representative optimization-based and learning-based navigation approaches under identical conditions, performing repeated trials per platform-scenario combination and reporting uncertainty-aware metrics. The results reveal systematic patterns: navigation success depends predictably on platform capability and scene geometry, and different algorithms exhibit distinct preferences and failure modes across the evaluated conditions. These observations highlight the practical necessity of incorporating both platform capability and scenario structure into algorithm design, evaluation, and selection, and they motivate future work on methods that remain robust across diverse platforms and scenarios.

FLYINGTRUST: A Benchmark for Quadrotor Navigation Across Scenarios and Vehicles

TL;DR

Abstract

FLYINGTRUST: A Benchmark for Quadrotor Navigation Across Scenarios and Vehicles

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)