Table of Contents
Fetching ...

Quattro: Transformer-Accelerated Iterative Linear Quadratic Regulator Framework for Fast Trajectory Optimization

Yue Wang, Haoyu Wang, Zhaoxing Li

TL;DR

Quattro introduces Transformer-accelerated $iLQR$ to mitigate the sequential bottleneck in real-time trajectory optimization. By predicting intermediate feedback and feedforward gains with a decoder-only Transformer and deploying a specialized FPGA accelerator, it enables parallel computation and substantial reductions in latency within an MPC framework. The approach achieves up to 27x per-iteration speedups (and up to 17.8x end-to-end MPC acceleration) on cart-pole and quadrotor tasks, with notable power savings on edge hardware. The work demonstrates the viability of Transformer-based acceleration for real-time optimal control and presents a lightweight configuration that balances accuracy, speed, and hardware practicality, with clear avenues for broader robotic applications and learning-based robustness.

Abstract

Real-time optimal control remains a fundamental challenge in robotics, especially for nonlinear systems with stringent performance requirements. As one of the representative trajectory optimization algorithms, the iterative Linear Quadratic Regulator (iLQR) faces limitations due to their inherently sequential computational nature, which restricts the efficiency and applicability of real-time control for robotic systems. While existing parallel implementations aim to overcome the above limitations, they typically demand additional computational iterations and high-performance hardware, leading to only modest practical improvements. In this paper, we introduce Quattro, a transformer-accelerated iLQR framework employing an algorithm-hardware co-design strategy to predict intermediate feedback and feedforward matrices. It facilitates effective parallel computations on resource-constrained devices without sacrificing accuracy. Experiments on cart-pole and quadrotor systems show an algorithm-level acceleration of up to 5.3$\times$ and 27$\times$ per iteration, respectively. When integrated into a Model Predictive Control (MPC) framework, Quattro achieves overall speedups of 2.8$\times$ for the cart-pole and 17.8$\times$ for the quadrotor compared to the one that applies traditional iLQR. Transformer inference is deployed on FPGA to maximize performance, achieving further up to 20.8$\times$ speedup over prevalent embedded CPUs with over 11$\times$ power reduction than GPU and low hardware resource overhead.

Quattro: Transformer-Accelerated Iterative Linear Quadratic Regulator Framework for Fast Trajectory Optimization

TL;DR

Quattro introduces Transformer-accelerated to mitigate the sequential bottleneck in real-time trajectory optimization. By predicting intermediate feedback and feedforward gains with a decoder-only Transformer and deploying a specialized FPGA accelerator, it enables parallel computation and substantial reductions in latency within an MPC framework. The approach achieves up to 27x per-iteration speedups (and up to 17.8x end-to-end MPC acceleration) on cart-pole and quadrotor tasks, with notable power savings on edge hardware. The work demonstrates the viability of Transformer-based acceleration for real-time optimal control and presents a lightweight configuration that balances accuracy, speed, and hardware practicality, with clear avenues for broader robotic applications and learning-based robustness.

Abstract

Real-time optimal control remains a fundamental challenge in robotics, especially for nonlinear systems with stringent performance requirements. As one of the representative trajectory optimization algorithms, the iterative Linear Quadratic Regulator (iLQR) faces limitations due to their inherently sequential computational nature, which restricts the efficiency and applicability of real-time control for robotic systems. While existing parallel implementations aim to overcome the above limitations, they typically demand additional computational iterations and high-performance hardware, leading to only modest practical improvements. In this paper, we introduce Quattro, a transformer-accelerated iLQR framework employing an algorithm-hardware co-design strategy to predict intermediate feedback and feedforward matrices. It facilitates effective parallel computations on resource-constrained devices without sacrificing accuracy. Experiments on cart-pole and quadrotor systems show an algorithm-level acceleration of up to 5.3 and 27 per iteration, respectively. When integrated into a Model Predictive Control (MPC) framework, Quattro achieves overall speedups of 2.8 for the cart-pole and 17.8 for the quadrotor compared to the one that applies traditional iLQR. Transformer inference is deployed on FPGA to maximize performance, achieving further up to 20.8 speedup over prevalent embedded CPUs with over 11 power reduction than GPU and low hardware resource overhead.

Paper Structure

This paper contains 19 sections, 19 equations, 7 figures, 2 tables.

Figures (7)

  • Figure 1: Comparison of standard (Vanilla) iLQR and transformer-accelerated iLQR Framework. The red-shaded area represents the standard iLQR computation process. The blue-shaded area illustrates the proposed Quattro.
  • Figure 2: Detailed framework architecture of the iLQR-TF.
  • Figure 3: Comparison between ground truth (solid lines) and transformer-predicted (dashed lines) gain sequences for the cart-pole system, using a prompt length of 5 prior gain steps.
  • Figure 4: Running time and prediction accuracy for different computational distributions between iLQR and Transformer for the cart-pole (upper) and the quadrotor (lower) systems. Bar plots show computation time per iteration (left Y axis), and scatter plots indicate MSE distributions (right Y axis) for varying prompt lengths. Numbers in parentheses (e.g., iLQR (30) TF (20)) denote the computational steps performed by iLQR and the Transformer, respectively.
  • Figure 5: Cart-pole system: (upper) computational efficiency and (lower) system state trajectories comparison among iLQR, iLQR-TF, and blended (mixture of iLQR-TF and LQR). The horizon is 30 and the Transformer prediction length is 25.
  • ...and 2 more figures