BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics
Hao Wu, Xingjian Shi, Ziyue Huang, Penghao Zhao, Wei Xiong, Jinbao Xue, Yangyu Tao, Xiaomeng Huang, Weiyan Wang
TL;DR
BeamVQ addresses the mismatch between data-driven space-time forecasts and physical laws by introducing a self-training loop guided by physics-aware metrics. It converts any encoder-decoder forecast into a probabilistic model using a discrete code bank and employs beam search to generate multiple candidate futures, which are filtered by physics scores and used to augment training data. Across five benchmarks and ten backbones, BeamVQ yields substantial gains in both statistical accuracy (lower MSE) and physical alignment (divergence, TKE, energy spectrum), with notable improvements in long-term forecasting. The method is flexible, scalable, and ready to integrate with additional physical constraints and larger datasets to further enhance real-world predictive reliability.
Abstract
Data-driven deep learning has emerged as the new paradigm to model complex physical space-time systems. These data-driven methods learn patterns by optimizing statistical metrics and tend to overlook the adherence to physical laws, unlike traditional model-driven numerical methods. Thus, they often generate predictions that are not physically realistic. On the other hand, by sampling a large amount of high quality predictions from a data-driven model, some predictions will be more physically plausible than the others and closer to what will happen in the future. Based on this observation, we propose \emph{Beam search by Vector Quantization} (BeamVQ) to enhance the physical alignment of data-driven space-time forecasting models. The key of BeamVQ is to train model on self-generated samples filtered with physics-aware metrics. To be flexibly support different backbone architectures, BeamVQ leverages a code bank to transform any encoder-decoder model to the continuous state space into discrete codes. Afterwards, it iteratively employs beam search to sample high-quality sequences, retains those with the highest physics-aware scores, and trains model on the new dataset. Comprehensive experiments show that BeamVQ not only gave an average statistical skill score boost for more than 32% for ten backbones on five datasets, but also significantly enhances physics-aware metrics.
