Automated machine learning for physics-informed convolutional neural networks
Wanyun Zhou, Haoze Song, Xiaowen Chu
TL;DR
This work tackles the challenge of manually designing physics-informed CNNs (PICNNs) for parametric PDEs by introducing Auto-PICNN, an AutoML framework that automatically searches both loss functions and network architectures. It couples an operator-infused loss-function space with an entire-structured CNN space and employs a two-stage search (Bayesian optimization for losses, policy-based RL for architectures) to optimize PICNNs, including a ConvLSTM module to handle spatiotemporal dynamics. Empirically, Auto-PICNN substantially outperforms manually designed PICNN baselines and neural operators across six PDE systems, achieving up to a $59.78$-fold reduction in error on some tasks and an average $13.31$-fold improvement. The framework demonstrates notable efficiency in search time (roughly two days at worst) and shows robustness across diverse PDEs, offering a practical path to deploying physics-informed models without expert neural architecture search expertise.
Abstract
Recent advances in deep learning for solving partial differential equations (PDEs) have introduced physics-informed neural networks (PINNs), which integrate machine learning with physical laws. Physics-informed convolutional neural networks (PICNNs) extend PINNs by leveraging CNNs for enhanced generalization and efficiency. However, current PICNNs depend on manual design, and inappropriate designs may not effectively solve PDEs. Furthermore, due to the diversity of physical problems, the ideal network architectures and loss functions vary across different PDEs. It is impractical to find the optimal PICNN architecture and loss function for each specific physical problem through extensive manual experimentation. To surmount these challenges, this paper uses automated machine learning (AutoML) to automatically and efficiently search for the loss functions and network architectures of PICNNs. We introduce novel search spaces for loss functions and network architectures and propose a two-stage search strategy. The first stage focuses on searching for factors and residual adjustment operations that influence the loss function, while the second stage aims to find the best CNN architecture. Experimental results show that our automatic searching method significantly outperforms the manually-designed model on multiple datasets.
