Detecting Spatiotemporal b-Value Anomalies with a Progressive Deep Learning Architecture
Jonas Köhler, Wei Li, Johannes Faber, Georg Rümpker, Nishtha Srivastava
TL;DR
This paper develops a methodological framework to detect spatiotemporal anomalies in evolving daily $b$-value fields over Japan by constructing fine‑scale $b$-value maps and framing anomaly detection as a binary classification on 512 × 32 × 32 blocks. A hybrid CNN–TCN architecture processes spatial patterns and temporal dynamics, while a progressive time‑forward training scheme mitigates nonstationarity and prevents future information leakage. Internal validation across a time-forward catalog demonstrates how data density, sample balance, and large aftershock sequences (notably the 2011 Tōhoku event) shape model behavior and performance, with MAE-based configurations generally performing best. The work emphasizes careful interpretation of anomaly scores, attributes model behavior to catalog characteristics, and outlines a path toward adapting the approach for rate-like forecasts and broader seismic regimes. Overall, it provides a structured, reproducible framework for exploring how spatiotemporal $b$-value evolution relates to large earthquakes while highlighting methodological considerations and limitations.
Abstract
Identifying systematic patterns in seismicity that precede large earthquakes remains a central challenge in statistical seismology. In this work, we present a methodological framework for detecting spatiotemporal anomalies in seismicity using the evolution of gridded b-values. Focusing on the Japanese subduction zone, we construct daily b-value fields on a fine spatial grid by aggregating local seismicity over moving time windows, yielding a continuous 2+1D representation of seismic-state evolution. We formulate the problem as a binary classification task in which spatiotemporal blocks extracted from these $b$-value fields are labeled according to the occurrence of a target earthquake with \Mw $\geq 5$ in the central region within the next day. To model this data, we introduce a hybrid deep-learning architecture that combines a spatial convolutional encoder with a temporal convolutional network, enabling joint learning of spatial structure and temporal dynamics. A progressive meta-epoch training scheme is employed, in which the model is iteratively updated using a time-forward strategy that mirrors operational deployment and mitigates issues related to nonstationarity. This paper is strictly methodological in scope. It describes the construction of b-value fields, the spatiotemporal sampling strategy, the network architecture, and the progressive training and internal validation framework used for model development and parameter selection.
