An Efficient Graph-Transformer Operator for Learning Physical Dynamics with Manifolds Embedding
Pengwei Liu, Xingyu Ren, Pengkai Wang, Hangjie Yuan, Zhongkai Hao, Guanyu Chen, Chao Xu, Dong Ni, Shengze Cai
TL;DR
PhysGTO introduces an efficient Graph-Transformer Operator that embeds manifold structure in both physical and latent spaces to learn physical dynamics on unstructured meshes. It combines a topology-aware, flux-oriented message-passing mechanism with a projection-inspired, linear-complexity attention to capture local and global dependencies, achieving linear complexity in mesh size. A topology-aware mesh sampler enables scalable training on large meshes while maintaining accuracy, and a learnable subspace attention basis allows robust long-range reasoning. Across eleven datasets spanning static and transient 3D geometries, PhysGTO achieves state-of-the-art accuracy with substantial reductions in parameters and FLOPs, demonstrating both high fidelity and practical efficiency for real-world physics surrogates.
Abstract
Accurate and efficient physical simulations are essential in science and engineering, yet traditional numerical solvers face significant challenges in computational cost when handling simulations across dynamic scenarios involving complex geometries, varying boundary/initial conditions, and diverse physical parameters. While deep learning offers promising alternatives, existing methods often struggle with flexibility and generalization, particularly on unstructured meshes, which significantly limits their practical applicability. To address these challenges, we propose PhysGTO, an efficient Graph-Transformer Operator for learning physical dynamics through explicit manifold embeddings in both physical and latent spaces. In the physical space, the proposed Unified Graph Embedding module aligns node-level conditions and constructs sparse yet structure-preserving graph connectivity to process heterogeneous inputs. In the latent space, PhysGTO integrates a lightweight flux-oriented message-passing scheme with projection-inspired attention to capture local and global dependencies, facilitating multilevel interactions among complex physical correlations. This design ensures linear complexity relative to the number of mesh points, reducing both the number of trainable parameters and computational costs in terms of floating-point operations (FLOPs), and thereby allowing efficient inference in real-time applications. We introduce a comprehensive benchmark spanning eleven datasets, covering problems with unstructured meshes, transient flow dynamics, and large-scale 3D geometries. PhysGTO consistently achieves state-of-the-art accuracy while significantly reducing computational costs, demonstrating superior flexibility, scalability, and generalization in a wide range of simulation tasks.
