Scalable Back-Propagation-Free Training of Optical Physics-Informed Neural Networks
Yequan Zhao, Xinling Yu, Xian Xiao, Zhixiong Chen, Ziyue Liu, Geza Kurczveil, Raymond G. Beausoleil, Sijia Liu, Zheng Zhang
TL;DR
The paper addresses the challenge of training physics-informed neural networks (PINNs) on edge photonic hardware by introducing a completely back-propagation-free framework. It combines a sparse-grid Stein derivative estimator for loss evaluation with tensor-train (TT) based zeroth-order optimization to update parameters, enabling scalable training for PINNs with hundreds of neurons per layer. A scalable on-chip photonic accelerator (TONN) with space- and time-multiplexed designs is proposed to support real-size PINNs, dramatically reducing device counts and enabling real-time training in simulations. Across multiple PDE benchmarks and hardware simulations, the approach achieves competitive accuracy and substantial hardware efficiency, highlighting the practicality and potential of BP-free training for edge photonic computing and beyond.
Abstract
Physics-informed neural networks (PINNs) have shown promise in solving partial differential equations (PDEs), with growing interest in their energy-efficient, real-time training on edge devices. Photonic computing offers a potential solution to achieve this goal because of its ultra-high operation speed. However, the lack of photonic memory and the large device sizes prevent training real-size PINNs on photonic chips. This paper proposes a completely back-propagation-free (BP-free) and highly salable framework for training real-size PINNs on silicon photonic platforms. Our approach involves three key innovations: (1) a sparse-grid Stein derivative estimator to avoid the BP in the loss evaluation of a PINN, (2) a dimension-reduced zeroth-order optimization via tensor-train decomposition to achieve better scalability and convergence in BP-free training, and (3) a scalable on-chip photonic PINN training accelerator design using photonic tensor cores. We validate our numerical methods on both low- and high-dimensional PDE benchmarks. Through circuit simulation based on real device parameters, we further demonstrate the significant performance benefit (e.g., real-time training, huge chip area reduction) of our photonic accelerator.
