Table of Contents
Fetching ...

Short-Term Solar Irradiance Forecasting Under Data Transmission Constraints

Joshua Edward Hammond, Ricardo A. Lara Orozco, Michael Baldea, Brian A. Korgel

TL;DR

This work tackles near-term solar irradiance forecasting under data transmission constraints by proposing a data-parsimonious CNN-LSTM that uses scalar sky-camera features and an optional noise input to capture unmeasured disturbances. The novelty lies in predicting the deviation from a long-term baseline (the persistence of cloudiness, POC) rather than irradiance itself, effectively de-trending the problem and improving forecast accuracy. The method employs three rolling training steps to optimize time representations, input sequence length, and feature importance, with a permutation-based analysis guiding feature selection. Empirically, the final model achieves about $75$ W/m$^2$ MAE, substantially better than the POC baseline of $134.35$ W/m$^2$, while requiring orders of magnitude less data than end-to-end image-based approaches, highlighting its practicality for bandwidth-limited, remote solar sites.

Abstract

We report a data-parsimonious machine learning model for short-term forecasting of solar irradiance. The model inputs include sky camera images that are reduced to scalar features to meet data transmission constraints. The output irradiance values are transformed to focus on unknown short-term dynamics. Inspired by control theory, a noise input is used to reflect unmeasured variables and is shown to improve model predictions, often considerably. Five years of data from the NREL Solar Radiation Research Laboratory were used to create three rolling train-validate sets and determine the best representations for time, the optimal span of input measurements, and the most impactful model input data (features). For the chosen test data, the model achieves a mean absolute error of 74.34 $W/m^2$ compared to a baseline 134.35 $W/m^2$ using the persistence of cloudiness model.

Short-Term Solar Irradiance Forecasting Under Data Transmission Constraints

TL;DR

This work tackles near-term solar irradiance forecasting under data transmission constraints by proposing a data-parsimonious CNN-LSTM that uses scalar sky-camera features and an optional noise input to capture unmeasured disturbances. The novelty lies in predicting the deviation from a long-term baseline (the persistence of cloudiness, POC) rather than irradiance itself, effectively de-trending the problem and improving forecast accuracy. The method employs three rolling training steps to optimize time representations, input sequence length, and feature importance, with a permutation-based analysis guiding feature selection. Empirically, the final model achieves about W/m MAE, substantially better than the POC baseline of W/m, while requiring orders of magnitude less data than end-to-end image-based approaches, highlighting its practicality for bandwidth-limited, remote solar sites.

Abstract

We report a data-parsimonious machine learning model for short-term forecasting of solar irradiance. The model inputs include sky camera images that are reduced to scalar features to meet data transmission constraints. The output irradiance values are transformed to focus on unknown short-term dynamics. Inspired by control theory, a noise input is used to reflect unmeasured variables and is shown to improve model predictions, often considerably. Five years of data from the NREL Solar Radiation Research Laboratory were used to create three rolling train-validate sets and determine the best representations for time, the optimal span of input measurements, and the most impactful model input data (features). For the chosen test data, the model achieves a mean absolute error of 74.34 compared to a baseline 134.35 using the persistence of cloudiness model.
Paper Structure (15 sections, 17 equations, 12 figures, 5 tables)

This paper contains 15 sections, 17 equations, 12 figures, 5 tables.

Figures (12)

  • Figure 1: CNN-LSTM Model Structure with an optional noise model where random numbers drawn from a Gaussian distribution to account for unmeasured variables and disturbances. Dimensions are shown in parentheses.
  • Figure 2: Irradiance measurements and a subset of sky-camera images from May 18, 2022 at the NREL SRRL BMS. Figure created by the authors based on data collected from stoffel1981. Letters denote the moments that the images were collected.
  • Figure 3: Validation Mean Absolute Error (W/m$^2$) across all predicted points ($t+10$ to $t+120$) for each model trained during Step 1 using year 3 as the test set. In this graphic, each row represents a different combination for time representation and each column represents a method for irradiance representation.
  • Figure 4: Probability density functions of the future irradiance representations. Note that the relative irradiance representations are much more tightly distributed around the center likely facilitating easier recognition of patterns which deviate from the present conditions.
  • Figure 5: FSS for a model trained on each future irradiance representation.
  • ...and 7 more figures