Neural NMPC through Signed Distance Field Encoding for Collision Avoidance
Martin Jacquet, Marvin Harms, Kostas Alexis
TL;DR
The paper presents SDF-NMPC, a system that encodes a single onboard range observation into a differentiable neural SDF and constrains a velocity-tracking NMPC to achieve mapless collision avoidance in unknown environments. It introduces a two-network architecture (a $eta$-VAE encoder and a coordinate-based MLP) to predict the SDF, trained in a sequential fashion, with a sampling-based scheme to generate targets. The NMPC treats the SDF as a differentiable constraint and enforces FoV limits, providing recursive feasibility and Lyapunov-like stability guarantees under fixed observations, while handling perception and odometry noise via slack variables. The approach is validated through extensive simulations and real-world forest experiments, demonstrating robust collision avoidance, resilience to odometry drift, and competitive performance against map-based and mapless baselines, with an open-source release for reproducibility and extension. The work advances mapless navigation by tightly integrating learned 3D environment representations with principled nonlinear control, enabling real-time, safety-critical operation in cluttered, unknown outdoor environments.
Abstract
This paper introduces a neural Nonlinear Model Predictive Control (NMPC) framework for mapless, collision-free navigation in unknown environments with Aerial Robots, using onboard range sensing. We leverage deep neural networks to encode a single range image, capturing all the available information about the environment, into a Signed Distance Function (SDF). The proposed neural architecture consists of two cascaded networks: a convolutional encoder that compresses the input image into a low-dimensional latent vector, and a Multi-Layer Perceptron that approximates the corresponding spatial SDF. This latter network parametrizes an explicit position constraint used for collision avoidance, which is embedded in a velocity-tracking NMPC that outputs thrust and attitude commands to the robot. First, a theoretical analysis of the contributed NMPC is conducted, verifying recursive feasibility and stability properties under fixed observations. Subsequently, we evaluate the open-loop performance of the learning-based components as well as the closed-loop performance of the controller in simulations and experiments. The simulation study includes an ablation study, comparisons with two state-of-the-art local navigation methods, and an assessment of the resilience to drifting odometry. The real-world experiments are conducted in forest environments, demonstrating that the neural NMPC effectively performs collision avoidance in cluttered settings against an adversarial reference velocity input and drifting position estimates.
