Variability in Performance of a Machine-Learning Seismicity Catalog: Central Italy, 2016-2017
Jaehong Chung, Yifan Yu, Lauro Chiaraluce, Maddalena Michele, Gregory C. Beroza
TL;DR
The study tackles how machine-learning–based seismic catalogs affect detection performance across a network, not just in aggregate. It introduces a probability-based magnitude-of-completeness (PMC) framework that converts station-level detection probabilities into spatial maps of $M_c(\mathbf{x})$, using a logistic regression model for P- and S-waves and a minimum eight-station criterion to define network detectability. Results show broad reductions in $M_c(\mathbf{x})$ with ML catalogs, but with greater across-station variability, and pronounced gains in densely instrumented regions—especially for S-waves—highlighting both the benefits and the limits of ML-based monitoring. The framework provides a practical tool for evaluating catalog quality, guiding network design, and informing seismic-risk assessments with spatially resolved detectability metrics.
Abstract
Machine learning (ML) catalogs contain many more earthquakes than routine catalogs, but their performance in phase picking and earthquake detection has not been fully evaluated. We develop station-level detection probabilities using logistic regression and combine them across a seismic network to compute spatial magnitude-of-completeness fields. We apply this approach to two catalogs from the 2016-2017 Central Italy sequence that were constructed from the same seismic network, one routine and one ML based. At the station level, the ML picker increases detection sensitivity by identifying smaller magnitude events and detecting earthquakes at greater distances. Spatially, the magnitude-of-completeness decreases substantially, with median values shifting from 1.6 to 0.5 for P waves and from 1.7 to 0.5 for S waves. However, the ML catalog also shows greater variability in station-level performance than the routine catalog. These results demonstrate that ML-based improvements in detectability are widespread but spatially non-uniform, highlighting their benefits, their limitations, and the potential for further improvements.
