High-Altitude Balloon Station-Keeping with First Order Model Predictive Control
Myles Pasetsky, Jiawei Lin, Bradley Guo, Sarah Dean
TL;DR
High-altitude balloon station-keeping is challenged by nonlinear, underactuated dynamics and partially observable winds. The authors develop First-Order Model Predictive Control (FOMPC) by implementing differentiable balloon and wind models in JAX to enable gradient-based online planning over a receding horizon, without offline training. In Balloon Learning Environment benchmarks, FOMPC outperforms the state-of-the-art RL controller Perciatelli44 by up to $24\%$ in time-within-radius (TWR) and reduces safety violations, though with higher per-step computation. The work provides a solid, training-free model-based baseline for HAB control and offers insights into which planning and wind-correction factors most influence performance, laying groundwork for future hybrids that integrate MPC and RL.
Abstract
High-altitude balloons (HABs) are common in scientific research due to their wide range of applications and low cost. Because of their nonlinear, underactuated dynamics and the partial observability of wind fields, prior work has largely relied on model-free reinforcement learning (RL) methods to design near-optimal control schemes for station-keeping. These methods often compare only against hand-crafted heuristics, dismissing model-based approaches as impractical given the system complexity and uncertain wind forecasts. We revisit this assumption about the efficacy of model-based control for station-keeping by developing First-Order Model Predictive Control (FOMPC). By implementing the wind and balloon dynamics as differentiable functions in JAX, we enable gradient-based trajectory optimization for online planning. FOMPC outperforms a state-of-the-art RL policy, achieving a 24% improvement in time-within-radius (TWR) without requiring offline training, though at the cost of greater online computation per control step. Through systematic ablations of modeling assumptions and control factors, we show that online planning is effective across many configurations, including under simplified wind and dynamics models.
