From Cheap Geometry to Expensive Physics: Elevating Neural Operators via Latent Shape Pretraining
Zhizhou Zhang, Youjia Wu, Kaixuan Zhang, Yanjia Wang
TL;DR
The paper tackles the data-scarcity challenge in neural operators for PDEs by introducing a two-stage training framework that exploits abundant geometry-only data. In Stage 1, a physics-agnostic pretraining step trains a point-cloud VAE to learn latent geometry representations via an occupancy-field proxy; in Stage 2, neural operators are trained to map these learned latents to PDE solutions, freezing the encoder. Across four PDE datasets and three transformer-based operators, the approach yields consistent accuracy gains, especially for query batches drawn from random locations, demonstrating that geometry-informed latent representations enhance data efficiency. The method offers a flexible, scalable path to improve surrogate PDE solvers in industrial settings where labeled physics data are scarce but geometry samples are plentiful.
Abstract
Industrial design evaluation often relies on high-fidelity simulations of governing partial differential equations (PDEs). While accurate, these simulations are computationally expensive, making dense exploration of design spaces impractical. Operator learning has emerged as a promising approach to accelerate PDE solution prediction; however, its effectiveness is often limited by the scarcity of labeled physics-based data. At the same time, large numbers of geometry-only candidate designs are readily available but remain largely untapped. We propose a two-stage framework to better exploit this abundant, physics-agnostic resource and improve supervised operator learning under limited labeled data. In Stage 1, we pretrain an autoencoder on a geometry reconstruction task to learn an expressive latent representation without PDE labels. In Stage 2, the neural operator is trained in a standard supervised manner to predict PDE solutions, using the pretrained latent embeddings as inputs instead of raw point clouds. Transformer-based architectures are adopted for both the autoencoder and the neural operator to handle point cloud data and integrate both stages seamlessly. Across four PDE datasets and three state-of-the-art transformer-based neural operators, our approach consistently improves prediction accuracy compared to models trained directly on raw point cloud inputs. These results demonstrate that representations from physics-agnostic pretraining provide a powerful foundation for data-efficient operator learning.
