Auto-Multilift: Distributed Learning and Control for Cooperative Load Transportation With Quadrotors
Bingheng Wang, Rui Huang, Lin Zhao
TL;DR
Auto-Multilift presents a distributed, closed-loop framework that automatically tunes MPC hyperparameters for cooperative cable-suspended load transport with quadrotors. It models MPC costs with deep neural networks and trains them via a distributed policy gradient using closed-loop trajectories, facilitated by distributed sensitivity propagation that exploits multi-agent couplings. The method demonstrates scalability to multiple quadrotors and outperforms open-loop MPC tuning, including in obstacle-rich scenarios requiring adaptive tension references. The combination of DSP and Safe-PDP-based gradient computation enables efficient end-to-end learning directly from system tracking errors, with potential for real-time adaptation and expanded applicability to complex multi-robot transport tasks.
Abstract
Designing motion control and planning algorithms for multilift systems remains challenging due to the complexities of dynamics, collision avoidance, actuator limits, and scalability. Existing methods that use optimization and distributed techniques effectively address these constraints and scalability issues. However, they often require substantial manual tuning, leading to suboptimal performance. This paper proposes Auto-Multilift, a novel framework that automates the tuning of model predictive controllers (MPCs) for multilift systems. We model the MPC cost functions with deep neural networks (DNNs), enabling fast online adaptation to various scenarios. We develop a distributed policy gradient algorithm to train these DNNs efficiently in a closed-loop manner. Central to our algorithm is distributed sensitivity propagation, which is built on fully exploiting the unique dynamic couplings within the multilift system. It parallelizes gradient computation across quadrotors and focuses on actual system state sensitivities relative to key MPC parameters. Extensive simulations demonstrate favorable scalability to a large number of quadrotors. Our method outperforms a state-of-the-art open-loop MPC tuning approach by effectively learning adaptive MPCs from trajectory tracking errors. It also excels in learning an adaptive reference for reconfiguring the system when traversing multiple narrow slots.
