Noise-Guided Transport for Imitation Learning

Lionel Blondé; Joao A. Candido Ramos; Alexandros Kalousis

Noise-Guided Transport for Imitation Learning

Lionel Blondé, Joao A. Candido Ramos, Alexandros Kalousis

TL;DR

Noise-Guided Transport (NGT) tackles imitation learning under ultra-low data by learning a reward signal through an OT-grounded adversarial objective. It uses a predictor f_ξ and a frozen random-prior f†_ξ to form a 1-Lipschitz potential h_ξ and defines the reward r_ξ=exp(-h_ξ), with the objective L(ξ)=E_{expert}[h_ξ]−E_{agent}[h_ξ] connected to the Earth Mover's Distance between expert and agent distributions. The method enforces Lipschitz continuity with spectral normalization and orthogonal initialization, and leverages distributional HL-Gaussian losses to stabilize training in high-dimensional tasks, achieving expert performance with as few as 20 transitions, even in state-only settings. Theoretical results include a concentration bound for the empirical loss and a Lipschitz analysis of HL-Gaussian, guiding hyperparameters such as the smoothing scale σ and the bin count N. Overall, NGT presents a lightweight, off-policy, pretraining-free IL approach with strong data-efficiency, competitive runtime, and applicability to healthcare and biorobotics where demonstrations are scarce.

Abstract

We consider imitation learning in the low-data regime, where only a limited number of expert demonstrations are available. In this setting, methods that rely on large-scale pretraining or high-capacity architectures can be difficult to apply, and efficiency with respect to demonstration data becomes critical. We introduce Noise-Guided Transport (NGT), a lightweight off-policy method that casts imitation as an optimal transport problem solved via adversarial training. NGT requires no pretraining or specialized architectures, incorporates uncertainty estimation by design, and is easy to implement and tune. Despite its simplicity, NGT achieves strong performance on challenging continuous control tasks, including high-dimensional Humanoid tasks, under ultra-low data regimes with as few as 20 transitions. Code is publicly available at: https://github.com/lionelblonde/ngt-pytorch.

Noise-Guided Transport for Imitation Learning

TL;DR

Abstract

Noise-Guided Transport for Imitation Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (11)