Adversarial Observations in Weather Forecasting
Erik Imgrund, Thorsten Eisenhofer, Konrad Rieck
TL;DR
This work demonstrates that AI-based weather forecasting with autoregressive diffusion, such as GenCast, is vulnerable to adversarial observations that can fabricate extreme weather or conceal real events while keeping perturbations within plausible noise bounds. It introduces a novel attack framework that approximates the diffusion inference across multiple noise levels, with a per-variable perturbation constraint, and shows superior performance over baselines across diverse locations and events. Using ERA5 data, the authors quantify that perturbations as small as $\epsilon$ can yield dramatic forecast deviations (e.g., wind up to $30.7\,\mathrm{m/s}$, temperature up $\sim$ $24.6$°C, precipitation up to $221\,\mathrm{mm}$ in 12 hours) and can obscure major historical events like Cyclone Amphan, the 2006 European heat wave, and Hurricane Katrina. The study also finds that simple statistical detection via a chi-square test for variance would rarely detect such attacks, underscoring the urgency of robust data-provenance and defense strategies before large-scale deployment of AI-based weather models.
Abstract
AI-based systems, such as Google's GenCast, have recently redefined the state of the art in weather forecasting, offering more accurate and timely predictions of both everyday weather and extreme events. While these systems are on the verge of replacing traditional meteorological methods, they also introduce new vulnerabilities into the forecasting process. In this paper, we investigate this threat and present a novel attack on autoregressive diffusion models, such as those used in GenCast, capable of manipulating weather forecasts and fabricating extreme events, including hurricanes, heat waves, and intense rainfall. The attack introduces subtle perturbations into weather observations that are statistically indistinguishable from natural noise and change less than 0.1% of the measurements - comparable to tampering with data from a single meteorological satellite. As modern forecasting integrates data from nearly a hundred satellites and many other sources operated by different countries, our findings highlight a critical security risk with the potential to cause large-scale disruptions and undermine public trust in weather prediction.
