Reinforcement Learning for Data-Driven Workflows in Radio Interferometry. I. Principal Demonstration in Calibration
Brian M. Kirk, Urvashi Rau, Ramyaa Ramyaa
TL;DR
This work tackles the automation of data reduction in radio interferometry by framing calibration and flagging as actions in a reinforcement learning environment. Using simulated VLA-like data, it defines a compact state space, six global actions, and a joint objective that balances residual Gaussian-approximation quality with runtime, via $action\text{-}value = 1\times 10^{6}\cdot EMD + runtime$. The results show that Q-learning can discover optimal single actions and, with RFI, optimal action sequences; neural networks and decision trees provide continuous and interpretable mappings from data state to actions. The study demonstrates data-driven automation, heuristic discovery, and tool diagnostics, and outlines a clear path toward applying these ideas to real data and including imaging stages for self-calibration and scalable automation.
Abstract
Radio interferometry is an observational technique used to study astrophysical phenomena. Data gathered by an interferometer requires substantial processing before astronomers can extract the scientific information from it. Data processing consists of a sequence of calibration and analysis procedures where choices must be made about the sequence of procedures as well as the specific configuration of the procedure itself. These choices are typically based on a combination of measurable data characteristics, an understanding of the instrument itself, an appreciation of the trade-offs between compute cost and accuracy, and a learned understanding of what is considered "best practice". A metric of absolute correctness is not always available and validity is often subject to human judgment. The underlying principles and software configurations to discern a reasonable workflow for a given dataset is the subject of training workshops for students and scientists. Our goal is to use objective metrics that quantify best practice, and numerically map out the decision space with respect to our metrics. With these objective metrics we demonstrate an automated, data-driven, decision system that is capable of sequencing the optimal action(s) for processing interferometric data. This paper introduces a simplified description of the principles behind interferometry and the procedures required for data processing. We highlight the issues with current automation approaches and propose our ideas for solving these bottlenecks. A prototype is demonstrated and the results are discussed.
