Learning Hierarchical Control Systems for Autonomous Systems with Energy Constraints
Charlott Vallon, Mark Pustilnik, Alessandro Pinto, Francesco Borrelli
TL;DR
This work tackles the problem of operating autonomous systems under energy constraints by proposing a learning-enabled hierarchical control architecture that couples high-level energy-aware task planning with a low-level data-driven MPC. The method formalizes a two-tier SA/DM framework where upper-level estimates of route time and energy $(\hat{T}_{i,j}, \hat{E}_{i,j})$ are refined through learning, while the lower level converts plans into executable trajectories and adapts via iteration-based energy-model updates. Demonstrations on a two-vehicle electric delivery scenario show that iterative learning reduces conservatism and increases tasks completed within daily limits, though learning-rate choices critically affect stability and feasibility. Overall, the paper contributes a modular framework for energy-aware autonomous operation and highlights theoretical avenues for guaranteeing safety, feasibility, and real-time applicability of learning in hierarchical control.
Abstract
This paper focuses on the design of hierarchical control architectures for autonomous systems with energy constraints. We focus on systems where energy storage limitations and slow recharge rates drastically affect the way the autonomous systems are operated. Using examples from space robotics and public transportation, we motivate the need for formally designed learning hierarchical control systems. We propose a learning control architecture which incorporates learning mechanisms at various levels of the control hierarchy to improve performance and resource utilization. The proposed hierarchical control scheme relies on high-level energy-aware task planning and assignment, complemented by a low-level predictive control mechanism responsible for the autonomous execution of tasks, including motion control and energy management. Simulation examples show the benefits and the limitations of the proposed architecture when learning is used to obtain a more energy-efficient task allocation.
