WaveCastNet: Rapid Wavefield Forecasting for Earthquake Early Warning via Deep Sequence to Sequence Learning
Dongwei Lyu, Rie Nakata, Pu Ren, Michael W. Mahoney, Arben Pitarka, Nori Nakata, N. Benjamin Erichson
TL;DR
WaveCastNet reframes earthquake ground-motion forecasting as spatiotemporal sequence-to-sequence prediction and introduces ConvLEM, a convolutional long expressive memory backbone, to capture multiscale spatial and temporal dynamics. The model supports both dense wavefield inputs and sparse sensor data, enabling fast, end-to-end forecasts of full waveforms without explicit source parameter estimation, and provides ensemble-based uncertainty estimates for operational decision-making. It demonstrates strong performance on synthetic point-source and finite-fault scenarios and shows zero-shot generalization to real-world data, with subsecond inference and robustness to noise and latency. Limitations include reduced accuracy for large magnitude, finite-fault earthquakes when trained only on point sources and the domain gap between synthetic and real data, motivating future work on broader finite-fault training, domain adaptation, and inference-velocity optimizations for real-time EEW deployment.
Abstract
We propose a new deep learning model, WaveCastNet, to forecast high-dimensional wavefields. WaveCastNet integrates a convolutional long expressive memory architecture into a sequence-to-sequence forecasting framework, enabling it to model long-term dependencies and multiscale patterns in both space and time. By sharing weights across spatial and temporal dimensions, WaveCastNet requires significantly fewer parameters than more resource-intensive models such as transformers, resulting in faster inference times. Crucially, WaveCastNet also generalizes better than transformers to rare and critical seismic scenarios, such as high-magnitude earthquakes. Here, we show the ability of the model to predict the intensity and timing of destructive ground motions in real time, using simulated data from the San Francisco Bay Area. Furthermore, we demonstrate its zero-shot capabilities by evaluating WaveCastNet on real earthquake data. Our approach does not require estimating earthquake magnitudes and epicenters, steps that are prone to error in conventional methods, nor does it rely on empirical ground-motion models, which often fail to capture strongly heterogeneous wave propagation effects.
