Optimal Control Operator Perspective and a Neural Adaptive Spectral Method
Mingquan Feng, Zhijie Chen, Yixin Huang, Yizhou Liu, Junchi Yan
TL;DR
This work reframes optimal control as learning an instance-to-solution operator $\mathcal{G}:I\to U$ that maps OCP instances to their optimal controls, enabling one-shot inference without explicit dynamics or iterative optimization. It introduces Neural Control Operator (NCO) realized by Neural Adaptive Spectral Method (NASM), an adaptive-basis neural operator that computes $\hat{\vb{u}}(t)$ from problem instances using time- and instance-dependent coefficients and an aggregation mechanism, with a formal approximation-error bound. Empirically, NASM achieves massive inference-time speedups (often $>10^3$–$10^4$×) and strong ID/OOD generalization across synthetic and real OCPs, including planary pushing and quadrotor control, outperforming several neural-operator baselines. The results indicate that NASM’s architectural design—adaptive basis, time-conditioned coefficients, and operator-learning—provides robust, reusable control solutions suitable for high-dimensional, data-rich control tasks, with clear avenues for enforcing constraints via physics-informed extensions.
Abstract
Optimal control problems (OCPs) involve finding a control function for a dynamical system such that a cost functional is optimized. It is central to physical systems in both academia and industry. In this paper, we propose a novel instance-solution control operator perspective, which solves OCPs in a one-shot manner without direct dependence on the explicit expression of dynamics or iterative optimization processes. The control operator is implemented by a new neural operator architecture named Neural Adaptive Spectral Method (NASM), a generalization of classical spectral methods. We theoretically validate the perspective and architecture by presenting the approximation error bounds of NASM for the control operator. Experiments on synthetic environments and a real-world dataset verify the effectiveness and efficiency of our approach, including substantial speedup in running time, and high-quality in- and out-of-distribution generalization.
