ELM-DeepONets: Backpropagation-Free Training of Deep Operator Networks via Extreme Learning Machines
Hwijae Son
TL;DR
ELM-DeepONet introduces a backpropagation-free training scheme for DeepONets by embedding Extreme Learning Machine ideas into the trunk-branch operator framework. By fixing the trunk and using a learnable matrix $\mathbf{W}$ to fuse branch outputs into trunk coefficients, the method reduces training to a fast least-squares problem with the solution $\widehat{\mathbf{W}} = \mathbf{T}^\dagger \mathbf{G} \mathbf{B}^\dagger$. Extensive experiments on nonlinear ODEs, Darcy flow, and a reaction-diffusion inverse source problem show that ELM-DeepONet achieves competitive or superior accuracy while drastically reducing training time compared to vanilla DeepONet. This work offers a scalable, efficient alternative for operator learning in scientific computing, with practical guidance on hyperparameters and potential extensions to physics-informed variants.
Abstract
Deep Operator Networks (DeepONets) are among the most prominent frameworks for operator learning, grounded in the universal approximation theorem for operators. However, training DeepONets typically requires significant computational resources. To address this limitation, we propose ELM-DeepONets, an Extreme Learning Machine (ELM) framework for DeepONets that leverages the backpropagation-free nature of ELM. By reformulating DeepONet training as a least-squares problem for newly introduced parameters, the ELM-DeepONet approach significantly reduces training complexity. Validation on benchmark problems, including nonlinear ODEs and PDEs, demonstrates that the proposed method not only achieves superior accuracy but also drastically reduces computational costs. This work offers a scalable and efficient alternative for operator learning in scientific computing.
