Walrus: A Cross-Domain Foundation Model for Continuum Dynamics
Michael McCabe, Payel Mukhopadhyay, Tanya Marwah, Bruno Regaldo-Saint Blancard, Francois Rozet, Cristiana Diaconu, Lucas Meyer, Kaze W. K. Wong, Hadi Sotoudeh, Alberto Bietti, Irina Espejo, Rio Fear, Siavash Golkar, Tom Hehir, Keiya Hirashima, Geraud Krawezik, Francois Lanusse, Rudy Morel, Ruben Ohana, Liam Parker, Mariel Pettee, Jeff Shen, Kyunghyun Cho, Miles Cranmer, Shirley Ho
TL;DR
Walrus introduces a cross-domain foundation model for continuum dynamics by extending transformer architectures with stability and efficiency mechanisms tailored to heterogeneous 2D/3D physics data. Key contributions include patch jittering, 2D–3D data augmentation, adaptive-compute tokenization, and topology-aware sampling, enabling stable long-horizon forecasting across 19 physical scenarios. Empirical results show Walrus achieving substantial improvements over prior foundation models on both short- and long-horizon tasks and across diverse domains, with 3D performance highlighting practical viability for real-world simulations. The work emphasizes the importance of representational diversity in pretraining and proposes scalable training strategies to handle multi-resolution, multi-physics data, marking a substantive step toward robust, cross-domain physics emulation.
Abstract
Foundation models have transformed machine learning for language and vision, but achieving comparable impact in physical simulation remains a challenge. Data heterogeneity and unstable long-term dynamics inhibit learning from sufficiently diverse dynamics, while varying resolutions and dimensionalities challenge efficient training on modern hardware. Through empirical and theoretical analysis, we incorporate new approaches to mitigate these obstacles, including a harmonic-analysis-based stabilization method, load-balanced distributed 2D and 3D training strategies, and compute-adaptive tokenization. Using these tools, we develop Walrus, a transformer-based foundation model developed primarily for fluid-like continuum dynamics. Walrus is pretrained on nineteen diverse scenarios spanning astrophysics, geoscience, rheology, plasma physics, acoustics, and classical fluids. Experiments show that Walrus outperforms prior foundation models on both short and long term prediction horizons on downstream tasks and across the breadth of pretraining data, while ablation studies confirm the value of our contributions to forecast stability, training throughput, and transfer performance over conventional approaches. Code and weights are released for community use.
