Table of Contents
Fetching ...

Siberian radioheliograph image classification using ensemble of CLIP, EfficientNet and CatBoost models

Yaroslav Egorov

TL;DR

The paper addresses the challenge of automatically assessing SRH image quality to filter out low-quality solar radio images. It introduces a multi-model ensemble that combines zero-shot CLIP labeling, fine-tuned EfficientNet, and CatBoost, with an FFNN ensemble to produce final GOOD/BAD classifications. The ensemble achieves the best performance (≈95% accuracy) and is deployed as a daily online service with a REST API, enabling seamless integration into SRH calibration and CLEAN-map workflows. This work enhances the reliability of SRH data products and supports more robust solar physics analyses. The approach demonstrates the value of combining multimodal embeddings and ensemble learning for automated quality control in radio solar imaging.

Abstract

The Siberian Radioheliograph (SRH) is a ground-based radio interferometer in Irkutsk, Russia, designed for high-resolution solar observations in the microwave range. It can observe dynamic solar events with spatial resolutions of 7-30 arcseconds and temporal resolution up to 0.1 seconds. Generating solar radio images from the Siberian Radioheliograph (SRH) is a multi-step calibration process that corrects instrumental and atmospheric distortions, using redundancy-based calibration with both adjacent and non-adjacent antenna pairs to address phase and amplitude errors in visibility data. The CLEAN algorithm is then applied to deconvolve the point spread function, reduce sidelobes, and enhance the visibility of solar features, resulting in higher quality and more reliable images. While the calibration process generally improves image quality, it can sometimes result in noisy or spatially shifted images that are not suitable for scientific use. We developed a deep learning approach for automatic image quality classification. The training dataset was prepared using a zero-shot CLIP model and further validated manually. We evaluated four different models: a fine-tuned EfficientNet, two CatBoost variants using embeddings from CLIP and EfficientNet, and an Ensemble model that combined predictions from all three individual models. The Ensemble model achieved the best performance. The SRH daily image classification service has been created and is available online at https://forecasting.iszf.irk.ru/srh along with an API offering IDL and Python examples. Integration of Ensemble model into SRH image generating and calibration workflow can improve image reliability and reduces low-quality entries in SRH data catalog, enhancing solar research outcomes.

Siberian radioheliograph image classification using ensemble of CLIP, EfficientNet and CatBoost models

TL;DR

The paper addresses the challenge of automatically assessing SRH image quality to filter out low-quality solar radio images. It introduces a multi-model ensemble that combines zero-shot CLIP labeling, fine-tuned EfficientNet, and CatBoost, with an FFNN ensemble to produce final GOOD/BAD classifications. The ensemble achieves the best performance (≈95% accuracy) and is deployed as a daily online service with a REST API, enabling seamless integration into SRH calibration and CLEAN-map workflows. This work enhances the reliability of SRH data products and supports more robust solar physics analyses. The approach demonstrates the value of combining multimodal embeddings and ensemble learning for automated quality control in radio solar imaging.

Abstract

The Siberian Radioheliograph (SRH) is a ground-based radio interferometer in Irkutsk, Russia, designed for high-resolution solar observations in the microwave range. It can observe dynamic solar events with spatial resolutions of 7-30 arcseconds and temporal resolution up to 0.1 seconds. Generating solar radio images from the Siberian Radioheliograph (SRH) is a multi-step calibration process that corrects instrumental and atmospheric distortions, using redundancy-based calibration with both adjacent and non-adjacent antenna pairs to address phase and amplitude errors in visibility data. The CLEAN algorithm is then applied to deconvolve the point spread function, reduce sidelobes, and enhance the visibility of solar features, resulting in higher quality and more reliable images. While the calibration process generally improves image quality, it can sometimes result in noisy or spatially shifted images that are not suitable for scientific use. We developed a deep learning approach for automatic image quality classification. The training dataset was prepared using a zero-shot CLIP model and further validated manually. We evaluated four different models: a fine-tuned EfficientNet, two CatBoost variants using embeddings from CLIP and EfficientNet, and an Ensemble model that combined predictions from all three individual models. The Ensemble model achieved the best performance. The SRH daily image classification service has been created and is available online at https://forecasting.iszf.irk.ru/srh along with an API offering IDL and Python examples. Integration of Ensemble model into SRH image generating and calibration workflow can improve image reliability and reduces low-quality entries in SRH data catalog, enhancing solar research outcomes.

Paper Structure

This paper contains 13 sections, 6 equations, 7 figures.

Figures (7)

  • Figure 1: The images from the SRH catalog represent typical cases used for classification, with "BAD" examples showing poor quality (top panel), and "GOOD" examples demonstrating high-quality (bottom panel)
  • Figure 2: A multimodal architecture of CLIP for image and text encoding. The model consists of separate encoders for images and text, mapping them into a shared embedding space.
  • Figure 3: Architecture of the Ensemble Classifier. The model takes outputs from 3 base models as input and processes them through a two-layer feedforward neural network with ReLU activation and dropout regularization (dropout rate = 0.5). A final softmax layer produces class probabilities for two output categories: "Bad" and "Good".
  • Figure 4: Illustration for processing and classifying solar radio images using a combination of image preprocessing, CLIP-based classification, transfer learning with EfficientNet, CatBoost models, and ensemble techniques. The workflow includes data preparation, manual validation, model training, embedding extraction, and ensemble classification to improve performance.
  • Figure 5: Performance comparison of different models on classification tasks. The results show the accuracy (%) and error rates (%) for four different approaches: (1) CLIP, (2) Fine-tuned EfficientNet, (3) CLIP embeddings + CatBoost, and (4) EfficientNet embeddings + CatBoost. The models are evaluated on two classes: "Good" and "Bad". Each model's prediction accuracy and confusion matrix percentages are displayed, illustrating the superior performance of ensemble methods in classification accuracy
  • ...and 2 more figures