Recurrent Neural Operators: Stable Long-Term PDE Prediction
Zaijun Ye, Chen-Song Zhang, Wansheng Wang
TL;DR
This work tackles long-term forecasting for time-dependent PDEs by addressing the train-test mismatch inherent in teacher forcing. It introduces Recurrent Neural Operators (RNOs), which train the operator through autoregressive rollouts, aligning training with inference dynamics. The authors prove that recurrent training reduces worst-case error growth from exponential to linear in the forecast horizon and demonstrate empirically that RNOs, especially r-MgNO, achieve superior long-horizon accuracy and stability compared to teacher-forced baselines and post-hoc refiners. The approach offers a principled path toward robust data-driven dynamics for complex PDE systems, albeit with higher training costs and potential vanishing-gradient challenges that warrant future optimization.
Abstract
Neural operators have emerged as powerful tools for learning solution operators of partial differential equations. However, in time-dependent problems, standard training strategies such as teacher forcing introduce a mismatch between training and inference, leading to compounding errors in long-term autoregressive predictions. To address this issue, we propose Recurrent Neural Operators (RNOs)-a novel framework that integrates recurrent training into neural operator architectures. Instead of conditioning each training step on ground-truth inputs, RNOs recursively apply the operator to their own predictions over a temporal window, effectively simulating inference-time dynamics during training. This alignment mitigates exposure bias and enhances robustness to error accumulation. Theoretically, we show that recurrent training can reduce the worst-case exponential error growth typical of teacher forcing to linear growth. Empirically, we demonstrate that recurrently trained Multigrid Neural Operators significantly outperform their teacher-forced counterparts in long-term accuracy and stability on standard benchmarks. Our results underscore the importance of aligning training with inference dynamics for robust temporal generalization in neural operator learning.
