The Simons Observatory: Alarms and Detector Quality Monitoring
David V. Nguyen, Sanah Bhimani, Nicholas Galitzki, Brian J. Koopman, Jack Lashner, Laura Newburgh, Max Silva-Feaver, Kyohei Yamada
TL;DR
The paper addresses the need for real-time alarms and data-quality monitoring in the Simons Observatory (SO) with more than 60,000 detectors across four telescopes. It details an integrated alarm system within the Observatory Control System (OCS) that uses InfluxDB, Grafana, Slack, and the campana service to monitor both housekeeping and detector-quality metrics, including architecture, data sources, and deployment experiences. Commissioning results demonstrate the system can manage hundreds of alert rules, support alert silencing, and promptly flag issues such as a helium leak in the PTC system, enabling rapid response and safeguarding observing time. The work presents a scalable, extensible framework suitable for SO expansions to UK/Japan and the Advanced Simons Observatory (ASO), with practical implications for robust, multi-telescope operations and data-quality assurance.
Abstract
The Simons Observatory (SO) is a group of modern telescopes dedicated to observing the polarized cosmic microwave background (CMB), transients, and more. The Observatory consists of four telescopes and instruments, with over 60,000 superconducting detectors in total, located at ~5,200 m altitude in the Atacama Desert of Chile. During observations, it is important to ensure the detectors, telescope platforms, calibration and receiver hardware, and site hardware are within operational bounds. To facilitate rapid response when problems arise with any part of the system, it is essential that alerts are generated and distributed to appropriate personnel if components exceed these bounds. Similarly, alerts are generated if the quality of the data has become degraded. In this paper, we describe the SO alarm system we developed within the larger Observatory Control System (OCS) framework, including the data sources, alert architecture, and implementation. We also present results from deploying the alarm system during the commissioning of the SO telescopes and receivers.
