Machine Learning Based Identification of Solar Disk and Plages in Kodaikanal Solar Observatory Historical Suncharts
Dibya Kirti Mishra, Subhamoy Chatterjee, Bibhuti Kumar Jha, Hemapriya Raju, Aditya Priyadarshi, Manjunath Hegde, Srinjana Routh, Dipankar Banerjee, M. Saleem Khan
TL;DR
This paper addresses the challenge of extracting plages from historical KoSO hand-drawn suncharts by developing a two-stage CNN-based pipeline (U-Net) for disk detection and plage segmentation. Disk geometry is refined with Canny and Hough transforms, while plages are detected on 2048×2048 suncharts using a patch-based U-Net with a ResNet-34 encoder and CVAT-ground-truth masks, achieving robust performance ($IoU \approx 0.8$). The resulting plage masks enable time-latitude diagrams and a plage-area series that show strong agreement with Ca II K full-disk observations ($r = 0.80$, $\rho = 0.85$), allowing the construction of a composite dataset that fills historical data gaps. Limitations include non-detections in pre-1916 due to grid differences and 1990 annotation peculiarities, with future work to generalize to other features, harmonize training data, and publish the data for community use.
Abstract
Kodaikanal Solar Observatory (KoSO) is one of the oldest solar observatories, possessing an archive of multi-wavelength solar observations, including white light, Ca II K, and H-alpha images spanning over a century. In addition to these observations, KoSO has preserved hand-drawn suncharts (1904-2022), on which various solar features such as sunspots, plages, filaments, and prominences are marked on the Stonyhurst grid with distinct colour coding. In this study, we present the first comprehensive result that includes the entire data set from these suncharts using a supervised Machine Learning model called "Convolutional Neural Networks (CNNs)", firstly to identify the solar disks from the charts (1909-2007), secondly to identify the plages, spanning 9 solar cycles (1916-2007). We train the CNN with the manually identified solar disk and plage. We first detect the solar limb and the North-South line in the suncharts, which enables the extraction of disk centre coordinates, radius, and P-angle. Following that, we use a CNN similar architecture to achieve accurate image segmentation for the identification of plages. We compare plage areas derived from the suncharts with those obtained from Ca II K full-disk observations, and find good agreement that demonstrates the potential application of such an ML technique for historical data. The results of this study further demonstrate the potential application of sunchart data to fill the existing data gaps in the KoSO multi-wavelength observations and contribute toward constructing a composite series over the last century.
