Table of Contents
Fetching ...

A self-regulated convolutional neural network for classifying variable stars

Francisco Pérez-Galarce, Jorge Martínez-Palomera, Karim Pichara, Pablo Huijse, Márcio Catelan

TL;DR

The paper tackles data-shift and class-imbalance challenges in variable-star classification by coupling a self-regulated CNN with a physics-informed PELS-VAE to generate synthetic light curves conditioned on Gaia DR3 parameters. The classifier uses dual masks to learn from real and synthetic data, with synthetic samples injected according to epoch-specific policies and driven by BGMM sampling of physical parameters. Empirical results on OGLE and Gaia DR3-derived biases show improved robustness across loss functions, policies, signal-to-noise ratios, and sequence lengths, and they provide a framework for more reliable hyperparameter search. This approach enhances generalisation to unseen, underrepresented regions of the physical-parameter space, with practical implications for time-domain surveys and online classification pipelines.

Abstract

Over the last two decades, machine learning models have been widely applied and have proven effective in classifying variable stars, particularly with the adoption of deep learning architectures such as convolutional neural networks, recurrent neural networks, and transformer models. While these models have achieved high accuracy, they require high-quality, representative data and a large number of labelled samples for each star type to generalise well, which can be challenging in time-domain surveys. This challenge often leads to models learning and reinforcing biases inherent in the training data, an issue that is not easily detectable when validation is performed on subsamples from the same catalogue. The problem of biases in variable star data has been largely overlooked, and a definitive solution has yet to be established. In this paper, we propose a new approach to improve the reliability of classifiers in variable star classification by introducing a self-regulated training process. This process utilises synthetic samples generated by a physics-enhanced latent space variational autoencoder, incorporating six physical parameters from Gaia Data Release 3. Our method features a dynamic interaction between a classifier and a generative model, where the generative model produces ad-hoc synthetic light curves to reduce confusion during classifier training and populate underrepresented regions in the physical parameter space. Experiments conducted under various scenarios demonstrate that our self-regulated training approach outperforms traditional training methods for classifying variable stars on biased datasets, showing statistically significant improvements.

A self-regulated convolutional neural network for classifying variable stars

TL;DR

The paper tackles data-shift and class-imbalance challenges in variable-star classification by coupling a self-regulated CNN with a physics-informed PELS-VAE to generate synthetic light curves conditioned on Gaia DR3 parameters. The classifier uses dual masks to learn from real and synthetic data, with synthetic samples injected according to epoch-specific policies and driven by BGMM sampling of physical parameters. Empirical results on OGLE and Gaia DR3-derived biases show improved robustness across loss functions, policies, signal-to-noise ratios, and sequence lengths, and they provide a framework for more reliable hyperparameter search. This approach enhances generalisation to unseen, underrepresented regions of the physical-parameter space, with practical implications for time-domain surveys and online classification pipelines.

Abstract

Over the last two decades, machine learning models have been widely applied and have proven effective in classifying variable stars, particularly with the adoption of deep learning architectures such as convolutional neural networks, recurrent neural networks, and transformer models. While these models have achieved high accuracy, they require high-quality, representative data and a large number of labelled samples for each star type to generalise well, which can be challenging in time-domain surveys. This challenge often leads to models learning and reinforcing biases inherent in the training data, an issue that is not easily detectable when validation is performed on subsamples from the same catalogue. The problem of biases in variable star data has been largely overlooked, and a definitive solution has yet to be established. In this paper, we propose a new approach to improve the reliability of classifiers in variable star classification by introducing a self-regulated training process. This process utilises synthetic samples generated by a physics-enhanced latent space variational autoencoder, incorporating six physical parameters from Gaia Data Release 3. Our method features a dynamic interaction between a classifier and a generative model, where the generative model produces ad-hoc synthetic light curves to reduce confusion during classifier training and populate underrepresented regions in the physical parameter space. Experiments conducted under various scenarios demonstrate that our self-regulated training approach outperforms traditional training methods for classifying variable stars on biased datasets, showing statistically significant improvements.

Paper Structure

This paper contains 29 sections, 9 equations, 7 figures, 8 tables.

Figures (7)

  • Figure 1: Classifier model architecture. The input layer receives the $(\Delta t_i, \Delta m_i)_{i=1}^L$ representation. A sequence of 1D convolutional layers processes this input, where distinct sets of filters, managed by masks of weights, learn from real data ($\mathcal{L}_1$, depicted in light-blue) and synthetic data ($\mathcal{L}_2$, depicted in light-green), respectively. Each colour transformation corresponds to the filter operation it represents: light-blue for $\mathcal{L}_1$ and light-green for $\mathcal{L}_2$.
  • Figure 2: Diagram illustrating one epoch of the proposed training overview. The process involves conditional checks for generating new synthetic samples, sampling strategies for obtaining synthetic physical parameters, multiple-output regression for predicting latent space, and transformations for obtaining representations used by the classifier model.
  • Figure 3: Comparison of original and transformed samples from a GMM for a random variable that does not have physical meaning (Value). The histogram on the left shows the distribution of samples from the original distribution $(\mathcal{N} \left( 05, 1001 \right)$), and the histogram on the right shows the modified distribution using a $b = 0.6$, which accentuates the tails of the original distribution. Section \ref{['gmmsection']} dives into this distribution adaptation.
  • Figure 4: Physical parameters by class extracted from Gaia DR3. Each color represents a star type, as indicated in the legend. CEP represents Cepheids, DSCT represents delta Scuti stars, ECL represents eclipsing binaries, LPV represents long period variables, and RRLYR represents RR Lyrae stars.
  • Figure 5: Dataflow for synthetic light curves. The first column shows the light curves predicted by the PELS-VAE for these samples, i.e., the normalised and phased light curve outputted from the PELS-VAE. The second column presents a reverted light curve according to the procedure explained in Section \ref{['adapt']}. The third column illustrates a synthetic folded light curve (blue) with respect to a light curve in the training set (grey) with similar physical parameters to validate the process and visualise the impact of added noise. Finally, the last column provides the $(\Delta t, \Delta m)_{n=1}^L$ representation. Each row corresponds to a different variable star type (Cepheids, delta Scuti, eclipsing binaries, LPV and RR Lyrae).
  • ...and 2 more figures