Task-Parameter Nexus: Task-Specific Parameter Learning for Model-Based Control
Sheng Cheng, Ran Tao, Yuliang Gu, Shenlong Wang, Xiaofeng Wang, Naira Hovakimyan
TL;DR
The paper introduces Task-Parameter Nexus (TPN), a learning-based framework that online-predicts near-optimal control parameters for model-based controllers to track arbitrary trajectories. It builds a trajectory bank spanning independent speed and curvature axes, uses batch-DiffTune to offline auto-tune expert parameters for each task, and trains a DNN to map trajectory tasks to parameter vectors, with constraints enforced by RAYEN to maintain stability. In quadrotor simulations, TPN achieves near-expert performance within the training bank and generalizes to unseen tasks inside the bank, with robust behavior on out-of-bank and trigonometrically parameterized tasks compared to untrained parameters. The findings demonstrate that combining a rich task bank, batch differentiable tuning, and supervised learning enables rapid, robust runtime adaptation of control parameters across diverse tasks, with potential to extend to other dynamic systems and to real-world flight tests.
Abstract
This paper presents the Task-Parameter Nexus (TPN), a learning-based approach for online determination of the (near-)optimal control parameters of model-based controllers (MBCs) for tracking tasks. In TPN, a deep neural network is introduced to predict the control parameters for any given tracking task at runtime, especially when optimal parameters for new tasks are not immediately available. To train this network, we constructed a trajectory bank with various speeds and curvatures that represent different motion characteristics. Then, for each trajectory in the bank, we auto-tune the optimal control parameters offline and use them as the corresponding ground truth. With this dataset, the TPN is trained by supervised learning. We evaluated the TPN on the quadrotor platform. In simulation experiments, it is shown that the TPN can predict near-optimal control parameters for a spectrum of tracking tasks, demonstrating its robust generalization capabilities to unseen tasks.
