Comprehensive assessment, review, and comparison of AI models for solar irradiance prediction based on different time/estimation intervals

Bamisile, Olusola; Cai, Dongsheng; Oluwasanmi, Ariyo; Ejiyi, Chukwuebuka; Ukwuoma, Chiagoziem C.; Ojo, Oluwasegun; Mukhtar, Mustapha; Huang, Qi

doi:10.1038/s41598-022-13652-w

Download PDF

Article
Open access
Published: 10 June 2022

Comprehensive assessment, review, and comparison of AI models for solar irradiance prediction based on different time/estimation intervals

Olusola Bamisile¹,
Dongsheng Cai¹,
Ariyo Oluwasanmi²,
Chukwuebuka Ejiyi²,
Chiagoziem C. Ukwuoma²,
Oluwasegun Ojo^3,4,
Mustapha Mukhtar⁵ &
…
Qi Huang¹

Scientific Reports volume 12, Article number: 9644 (2022) Cite this article

4611 Accesses
12 Citations
9 Altmetric
Metrics details

Subjects

Abstract

Solar energy-based technologies have developed rapidly in recent years, however, the inability to appropriately estimate solar energy resources is still a major drawback for these technologies. In this study, eight different artificial intelligence (AI) models namely; convolutional neural network (CNN), artificial neural network (ANN), long short-term memory recurrent model (LSTM), eXtreme gradient boost algorithm (XG Boost), multiple linear regression (MLR), polynomial regression (PLR), decision tree regression (DTR), and random forest regression (RFR) are designed and compared for solar irradiance prediction. Additionally, two hybrid deep neural network models (ANN-CNN and CNN-LSTM-ANN) are developed in this study for the same task. This study is novel as each of the AI models developed was used to estimate solar irradiance considering different timesteps (hourly, every minute, and daily average). Also, different solar irradiance datasets (from six countries in Africa) measured with various instruments were used to train/test the AI models. With the aim to check if there is a universal AI model for solar irradiance estimation in developing countries, the results of this study show that various AI models are suitable for different solar irradiance estimation tasks. However, XG boost has a consistently high performance for all the case studies and is the best model for 10 of the 13 case studies considered in this paper. The result of this study also shows that the prediction of hourly solar irradiance is more accurate for the models when compared to daily average and minutes timestep. The specific performance of each model for all the case studies is explicated in the paper.

A multilayer perceptron neural network approach for optimizing solar irradiance forecasting in Central Africa with meteorological insights

Article Open access 12 February 2024

Solar power forecasting beneath diverse weather conditions using GD and LM-artificial neural networks

Article Open access 25 May 2023

A novel GBDT-BiLSTM hybrid model on improving day-ahead photovoltaic prediction

Article Open access 13 September 2023

Introduction

Nowadays, the world is almost impossible to envisage without its interrelationship and dependence on electricity¹. This electricity is mainly produced with fossil fuels and based on statistics, the global primary energy demand will increase by over 59% between 2002 and 2030 ². However, the evidential environmental impact of the current (fossil fuels) energy resources, as well as the need to reduce its climate change effect, led to the development of renewable energy sources (RES) ³. These RES have experienced significant growth in recent decades and they are projected to have as much as 39% share in global electricity generation by 2050 ⁴. Solar energy is a sustainable, clean, and extremely abundant RES ⁵ that poses a very low risk to its immediate environment and the world at large. The critical investigation into the accessibility and availability of renewable energy (RE) resources has witnessed a continuous evolvement, especially in developing countries. There is a rapid and consistent escalation in electricity demand in many developing countries as they strive toward advanced technological implementation and globalization ⁶. Therefore, it is imperative to initiate and encourage RES development in these regions.

Solar radiation influences agricultural production, atmospheric circulation, hydrological processes, public health as well as ecological services, and the comprehensive knowledge of this parameter at any location is important to its environmental sustainability and economic potential ⁷. Moreover, solar radiation is a crucial and decisive parameter for solar energy management and generation. Information about global solar radiation is also significant in many applications including; RE-usage, hydrology, and meteorology ⁸. The recent efforts and push for the replacement of fossil fuels with RES have made solar radiation a more important meteorological variable used to simulate and measure RE potential in any location. Unlike other meteorological parameters like relative humidity, temperature, and sunshine duration, the observation stations for solar radiation measurement are not globally available. This is due to the complicated measurement techniques and relatively high cost. Therefore, developing an accurate method or model to predict solar radiation is very important ⁹.

Typically, the models for solar radiation prediction or estimation can be classified into empirical, statistical, physical, and machine learning models ⁹. While physical models such as sky-image-based models explore the mechanism between solar radiation and other meteorological parameters ¹⁰, empirical models are aimed at developing a linear or non-linear regression equation for solar radiation estimation ¹¹. Statistical models such as the autoregressive moving-average model (ARIMA), are developed based on statistical correlation ¹². In recent years, artificial intelligence (AI) models have been used for better solar radiation prediction. The ability of these models to simulate nonlinear and complex relationship mapping as well as the capability to learn and extract meaning features from the input data via backpropagation and parameter update make it more desirable for this task ¹³.

The adoption of AI (machine learning and deep learning) models for the prediction or estimation of solar radiation have proven in literature to have a wider application and higher accuracy in comparison to other models. These models can accurately moderate the long-term, medium-term, and short-term prediction of solar radiation ¹⁴. Gurel et al. ¹⁵ presented the assessment of time series (Holt-Winters), machine learning (feed-forward neural network), empirical models (3 Angstrom-type models), and response surface methodology (RSM) for global solar radiation. Meteorological data obtained between 2008 and 2018 for four provinces in Turkey were used to train, validate, and test the models. Based on the performance evaluation of their models, the R² varied between 0.952 and 0.993 while the artificial neural network was concluded to present the best results ¹⁵. Furthermore, a review of some of the most recent literatures on solar radiation prediction with different models and methods is summarized in Table 1. This table highlights the type of model, case study, the aim of the study, and the performance summary of the models in different works of literature. Based on the articles reviewed in this table, the use of both unsupervised (machine) learning and supervised learning algorithms has been proposed for the forecast of solar irradiance. Therefore, the comparison of these models is one of the aims of this present study. Also, none of the proposed models were able to give a 100% accurate prediction/forecast of solar radiation in all the various locations. Hence the consistent recommendation stated in most of these research articles that future studies are required in this research domain to develop more accurate models for solar radiation forecasting.

Table 1 Summary of recent literature on solar radiation forecast/prediction.

Full size table

The expansion of solar energy-based technologies and applications will continue ⁴⁰. Therefore, the reliable estimation of solar radiation including its hourly, daily average, monthly average, annual, ⁴¹ and seasonal variability is of paramount importance for the estimation of solar energy capacity and potential ⁴². As mentioned earlier, the high cost and technological complexity attached to the measurement of solar radiation makes it a more difficult task in many meteorological stations. For example, there are 1798 meteorological stations in Turkey in the year 2020 and only 129 of the stations are capable of measuring solar radiation ⁴³. Also, out of the 756 meteorological stations in China, only 122 of them have the capability to measure solar radiation ⁴⁴. These further stresses the importance of solar radiation estimation. In most existing works of literature on solar radiation prediction, the prediction was done with different models. However, these models were compared based on the similarity of the class. Also, most models are used to predict a particular type of data type with a specific timestep. This has raised research questions about the adoption of different models for the various dataset, timesteps, and locations. Furthermore, developing countries (especially Africa) have enormous solar energy potential, however, the development of solar-based technologies has been very slow due to many reasons. One of which is inadequacies in the measurements of solar radiation.

Therefore, in this paper, we seek to further the knowledge of literature in this field by comparing different artificial intelligence (AI) models for solar radiation estimations. Eight different AI models namely; convolutional neural network (CNN), artificial neural network (ANN), long short-term memory recurrent model (LSTM), eXtreme gradient boost algorithm (XG Boost), multiple linear regression (MLR), polynomial regression (PLR), decision tree regression (DTR), and random forest regression (RFR) are compared for solar irradiance forecast. Additionally, two hybrid deep neural network models are developed in this study for this task. These models are a combination of two or more deep neural network models namely; ANN-CNN and CNN-LSTM-ANN. In comparison to existing techniques where a specific timestep is adopted, in this study, the models developed will be used to estimate the hourly, every minute, and daily average solar radiation. Also, different datasets such as typical meteorological year (TMY), surface radiation data set for heliostats (SARAH), and The World Bank solar radiation measurement data (WB-ESMAP) dataset are used to test the models developed in this paper. In comparison to literature where a specific solar irradiance data set is used, the research further contributes to literature by considering different measured solar irradiance datasets. These datasets include; global beam direct solar irradiance (GSR), diffused solar irradiance (DSR), daily average solar radiation flux at the surface normal to the direction of the sun (DNI), global horizontal irradiance measured from silicon pyranometer (GHI_Sil), diffused horizontal irradiance from rotating shadowband irradiometer (DHI_RSI), and global horizontal irradiance measured from thermopile pyranometer (GHI_pyr). These are useful for solar photovoltaics, solar thermal, solar heliostat, solar rooftop, and other solar technology applications.

This study seeks to determine the AI model that has a consistent accurate predictive performance for solar irradiance measured with various methods in different locations. Therefore, the datasets used in this study have been collected from 13 specific locations across six African countries. The viability of different AI models, when used for solar radiation prediction in different locations and considering various datasets as well as timesteps, is analysed in this study. One of the research questions that this study seeks to address is the possible sovereignty of an AI model for solar radiation estimation tasks considering differences in location, timestep, and dataset. While developing (African) countries has been used as the case study for the implementation of the AI algorithms developed in this study, the applicability of this models is not limited to developing countries only. They can be use in developed countries also however, some of the training parameters may require adjustments for the supervised AI algorithm. The rest of the article is organized as follows; a brief introduction to all the models considered in this study as well as the model development are explained in “Machine learning and deep learning algorithms” and “Data acquisition and preparation” sections. The performances of the models are presented in “Results” section and a brief discussion of these performances is stated in “Brief summary and discussion” section. The entire article is concluded in “Conclusions” section.

Machine learning and deep learning algorithms

Recent research have focused on forecasting renewable energy resources ^45,46,47, because of the growth in global RES and the integration of such sources into the electrical grid throughout the world. Recently, the projection of renewable energy production, notably wind and solar energy, has received considerable attention due to its considerable influence on operating and managing power management choices. Precise forecasts for the production of renewable energy-based systems are essential to ensure the continued dependability of the grid and to decrease energy market and energy systems risks/costs. Due to nature, the energy generated by solar and wind energies will always be unstable. Hence, the need to adopt sophisticated methodologies for the forecast of energy systems’ production. The methods adopted and compared in this study for solar energy resources forecast may be divided into 4 categories: physical methods, statistical models, techniques, and hybrid ways of artificial intelligence ⁴⁸. These are introduced in the following subsection.

Random forest regression

One of the most common machine learning methods is a random forest (RF) algorithm ⁴⁹. This is a controlled approach that employs a regression method for learning. The learning approach integrates various machine learning algorithms in order to generate predictions that are more accurate than a single model. In the course of training and determining the mean class of the classes, a random forest operates by building many decision trees as a forecast for all the trees ^50,51. Creating several trees for different subsets of the data points balances the prevalent overfitting problem, minimizes variance, and ensures improved accuracy. The RF algorithm is shown in Algorithm 1 while a sample of the RF tree is illustrated in Fig. 1.

Algorithm 1 Random Forest Algorithm
Start
Select from the training set a random k data point
Construct a decision tree for the k data points
Select N of the trees you would want to construct
Repeat
Steps 1 and 2
Make a prediction of the values of y for the data point for each of your N-trees and assign the new data point to the average across the whole number of y-values anticipated
End

The RF algorithm's predictive value is provided by the mathematical equation ⁵²;

$$ \hat{Y} = \frac{1}{N}\mathop \sum \limits_{n = 1}^{N} T_{n} \left( X \right) $$

(1)

where $Y$'s mean values are from $n, N, and T_{n} \left( x \right).$ Input parameters in $X$ indicate the number of random forest decision trees in N. The equation specifies the average number of $T_{n} , n = 1,2,...,N$ decision trees given the input $X$ in order to provide a solid forecast.

With the RF-Method, forecasts can be obtained and forecasting parameters identified (which are related to the response) via RF's integrated measurement of variable importance. This may also be taken into consideration and enhanced prognostics can be produced. Specifically, RF is adopted in this study for solar radiation forecast due to its use in existing works of literatures ⁵³. For instance, in three distinct sites with varied API conditions in China, Sun et al. ⁵⁴ utilize the random forest to estimate solar radiation given a single, accessible meteorological variable and air pollution index.

Polynomial regression

Polynomial regression is a specialized linear regression in which the data (having a curvilinear connection between the goal and the independent variables) are multinomially equated. Polynomial ensures a proper approximation of dependent and independent variables across a wide range of curvatures. The value of the target variable does not vary uniformly with regard to the predictor in a curvilinear relationship (s). The linear regression equation (Eq. (2)) with one predictor is transformed to polynomial equation of degree n in polynomial regression as Eq. (3).

$$ Y = \theta_{0} + \theta_{1} x $$

(2)

where $Y$ is the goal, $x$ is the predictor, $\theta_{0}$ is the bias, and $\theta_{1}$ is the weight of the equation of regression.

$$ Y = \theta_{0} + \theta_{1} x + \theta_{2} x^{2} + \theta_{3} x^{3} + \ldots + \theta_{n} x^{n} $$

(3)

Here $\theta_{0}$ is the bias, $\theta_{0} , \theta_{1} , \ldots . \theta_{n}$ are the weight of the polynomial regression equation and $n$ is the polynomial degree. Since hourly solar radiation profile follows a polynomial path, this AI algorithm is modelled in this study for the forecast of solar irradiance in accordance with the literature ⁵⁵.

Multi-linear regression

This AI algorithm employs numerous explanatory factors to predict the result of the response variable. The objective of multiple linear regression (MLR) model is to describe the linear connection between the (independent) explanatory and the (dependent) responsive variables. The connection of many independent variables $\left( {x_{1} ,x_{2} ,x_{3} ,x_{4} } \right)$ and a dependent variable $\left( {\hat{y}} \right)$ is explored and the first order of regression function employed in this investigation is presumed to be;

$$ \hat{y} = b_{0} + b_{1} x_{1} + b_{2} x_{2} + b_{3} x_{3} + b_{4} x_{4} $$

(4)

where $b_{0}$ is the y-axis cut-off point for the adjusted regression curve, $b_{1}$ is the first variable of guess $x_{1}$, and $b_{2}$ is the first variable of guessing $x_{2}$. The independent variables; wind speed, temperature, humidity, and pressure ($x_{1}$, $x_{2}$, $x_{3} $ and $x_{4}$) and dependency variable ($\hat{y}$) solar radiation are correspondingly used as a in this study.

Decision tree regression

Decision trees are hierarchical non-parametric structures, which build both regression and classification models in a tree shape. A decision tree operates recursively and splits the original input space constantly into sub-sets to accumulate instances in smaller areas ⁵⁶. The decision-making tree is gradually created during the breaking process, and a final decision-making tree with leaf nodes is generated. A blade node shows a choice on a discreet or ongoing objective. The ID3 and C4.5 decision tree algorithms, invented by Ross Quinlan, are frequently utilized in literature ⁵⁷. A novel application of decision tree classifier in solar irradiance prediction was presented by Singh et al. ⁵⁸. In this work, the technique of the C4.5 decision tree regression is used because of the continuous nature of the sun irradiance values ⁵⁹. In the form of a model regression tree, a predictor space is divided into j regions $\left( {R_{1} , R_{2} ,R_{3} \ldots ..R_{J} } \right)$ is depicted as Fig. 2. For all instances in the same region, the same prediction is made by the means of answers (for all training examples in the region). The basic goal throughout the construction of a decision tree regression model is to locate regions $\left( {R_{1} , \ldots ..R_{J} } \right)$ which minimize the remaining square sum.

XG BOOST

eXtreme Gradient Boosting (XG-Boost or XGB) is one of the most recent machine learning algorithms that is very good for 1D dataset. In terms of precision and speed, it has the best performance for most tasks ⁶⁰. It runs in parallel and distributed computing, thereby achieving a higher learning rate in comparison with other set algorithms. XG-boost is a modified algorithm for generalized gradient boosting and it creates a distinct type of tree from the boost algorithm for gradients. The split may be found using a similarity score and gain in XG-boost. The regulating parameter is used to prevent the split from overfitting. When the parameter regularization is nil it falls into the standard technique for gradient boosting. Two more approaches avoid overfitting together with regularization. One is the retraction scales that change the weight by a factor η at each step. Its goal is to decrease an individual tree's effect on the model. The second method is to employ subsampling of columns, which similarly improves training time. Another essential step is that an approximation method is used to identify the optimum division ⁶¹.

Long short-term memory (LSTM)

For the resolution of the disappearing and exploding gradient problem, LSTM offers memory blocks instead of traditional recurrent neural network (RNN) units ⁶². It then adds a cell state to stored long-term states (Fig. 3) which is the main difference between LSTM and the vanilla RNN. An LSTM network can recall and link prior data to current data ⁶³. Three gates are integrated, including the input gate, "forgetful" gate, and output gate where $x_{t}$ references the current input; new and predecessor cell states are referred by $C_{t}$ and $C_{t - 1}$ , respectively; and $h_{t}$ and $h_{t - 1}$ respectively the current and preceding cell outputs. The LSTM input gate principle is expressed in the following forms:

$$ i_{t} = \sigma \left( {W_{i} *\left[ {h_{i - 1} , x_{t} } \right] + b_{i} } \right) $$

(5)

$$ \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{C}_{t} = tanh\left( {W_{i} *\left[ {h_{i - 1} , x_{t} } \right] + b_{i} } \right) $$

(6)

$$ C_{t} = f_{t} C_{t - 1} + i_{t} \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{C}_{t} $$

(7)

where Eq. (5) is utilized to employ a Sigmoid layer to pass $h_{i - 1}$ and $x_{t}$ to determine the required information. Then $h_{i - 1}$ and $x_{t}$ passing through the tanh layer in Eq. (6) is used to obtain fresh information. In Eq. (7) $W_{i}$ refers to a sigmoid output and $\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{C}_{t}$ = a tanh output, the present moment information ($\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{C}_{t - 1}$) and the LSTM Information ($\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{C}_{t}$) is merged into $\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{C}_{t}$ . Here, $W_{i}$ indicates weight matrices and $b_{i}$ is the LSTM gate bias.

The forgetful gate of the LSTM then permits selective information transmission through a sigmoid layer and a dot product. The choice of forgetting the associated information of an earlier cell with some likelihood, with $W_{f}$ referring to the weight matrix, b_f the offset and σ is the sigmoid function, is done using Eq. (8).

$$ f_{i} = \sigma \left( {W_{f} *\left[ {h_{t - 1} ,x_{t} } \right] + b_{f} } \right) $$

(8)

The output gate of the LSTM determines the state of the following inputs: $h_{t - 1}$ and $x_{t}$ in Eq. (9) and Eq. (13) respectively. The final result is acquired and multiplied through the vectors for state decisions which transmit through the tanh layer new information, C_t,

$$ O_{t} = \sigma \left( {W_{0} *\left[ {h_{t - 1} ,x_{t} } \right] + b_{0} } \right) $$

(9)

$$ h_{t} = O_{t} \tanh \left( {C_{t} } \right) $$

(10)

where $W_{0}$ and $b_{0}$ are the weighted matrices of the output gate and LSTM bias respectively.

Artificial neural network (ANN)

The ANN is an information processing model that imitates biological neural network activities and structures found in human brains ⁶⁴. This AI model is used to solve linear and nonlinear regression tasks. Figure 4 illustrates a basic neural network, with 2 input neurons, X and Y, 3 neurons, and 1 neuron. For the desired offset, the threshold component is utilized. The weights $w_{i,j}$ where the indexes of the neurons are $i and j$ are $a_{i} and b_{i}$. To compute the weighted amount, first X and Y are multiplied by their weights. The result is then added to a partial function and supplied into an activation. Every neuron computed in the hidden layer, $h_{j}$, is calculated with $h_{j} = s\left( {\mathop \sum \limits_{i} w_{i.j} *h_{i} } \right)$, where $S$ is the activation function. The ReLU Rectified Linear Unit (ReLU) function, $S\left( x \right) = {\text{max}}\left( {0, x} \right)$ is used for hidden layer activation and nonlinear activation while the Sigmoid function $S\left( x \right) = \frac{1}{1} + e^{ - x}$ is applied on the output layer to model the network’s probability distribution. ANN is one of the most predominant supervised learning AI algorithm for solar radiation forecast in literature ⁶⁵^–⁶⁷, hence, its adaptation to the dataset in this study.

Convolutional neural network (CNN)

This model is a special kind of multilayer perceptron, however, unlike other deep learning architecture, the basic neural network is unable to learn complicated characteristics. In several applications ⁶⁸, CNN algorithms have shown great performance in the categorization of images, object recognition, and analysis of medical images. However, it has also been used for solar irradiance prediction tasks in the existing works of literature ^69,70. The basic principle behind a CNN is that local features are obtained from high layer entrances and transferred for more complicated features to lower layers (as shown in Fig. 5). CNN converts the input data from the input layer into a collection of class scores for the output layer across all linked layers. A CNN includes the full connecting layers, the pooling, and the convolutional layers.

A collection of kernels ⁷¹ is used to determine the feature mappings tensor in the convolutional layer. These kernels converge a whole input with 'stride(s)' to make a volume in its dimensions ⁷². After the convolutional layer is employed for the processing, the dimensions of an input volume shrink. Therefore, zero-padding ⁷³ is necessary for padding input volumes with zeros and maintaining low-level dimensions of an input volume. The functioning of the convolutional layer is:

$$ F\left( {i, j} \right) = \left( {I*K} \right)\left( {i, y} \right) = \mathop \sum \nolimits \mathop \sum \nolimits I\left( {i + m,j + n} \right)K\left( {m,n} \right) $$

(11)

$I $ refers to an input matrix, $K$ is a 2D filter of size $m to n,$ and $F$ is a 2D feature map output. $I*K$ indicates the functioning of the convolutionary layer. The rectified linear unit (ReLU) layer is used to increase nonlinearity on feature maps ⁷⁴. By maintaining the threshold input at zero, ReLU calculates the activation. The following is expressed mathematically:

$$ f\left( x \right) = {\text{max}}\left( {0,x} \right) $$

(12)

Downsampling of a particular dimension is performed by the pooling layer ⁷⁵, in order to minimize parameters. The most frequent way of max-pooling in the input region generates the maximum value. The FC layer ⁷⁶ is utilized as a classifier that decides on the characteristics derived from the convolutions and pooling layers. A CNN aims to learn more about data by use of convolutions. For CNN predictive models it is necessary to collect data from convolutional layers while regression work is carried out in the last fully connected layer ⁷⁷. In this study, the Convolution-1D (Conv1D) which is most suitable for text input data is implemented to convolve the input data points over temporal or single spatial dimensional tensors.

Hybrid CNN-ANN architecture

The network CNN-ANN combines both networks with the extraction of functionalities. CNN uses kernel technology to upgrade filter weights to understand how the training data are represented. The model contains a single CNN layer with 5 * 2 * 2-stride filters that complement the input data. The model of CNN contains hidden neuronal layers depending on the model for a specific dataset. The output of the CNN layer is flattened so that the complimentary ANN model may be supplied. The ANN network also consists of hidden layers of neurons and a one-node output layer. Both models are formed to compute the relevant derivatives as a single end-to-end network with a loss function as a cross-entropy. Adam optimizer, a learning rate of 0.001, and a training lot size of 512 were used for different epochs. Figure 6 illustrates the architecture of the model. The neurons in this hybrid system can be summed up as a result of the secret layers.

Every layer in a 1-D convolutional neural network mathematically extracts patterns in $G_{i}$, as it pertains to other input variables using Eq. (13) ²¹.

$$ h_{{\ddot{y}}}^{k} = f\left( {\left( {W^{k} *x} \right)_{ij} + b_{k} } \right) $$

(13)

W^k is the kernel weight associated with the k^th feature map, f represents the activation feature, and * is the operator. Equation (13), where c is the output $h_{{\ddot{y}}}^{k}$., can be rewritten under Eq. (14).

$$ q = f\left( {\left( {W^{k} *x} \right)_{ij} + b_{k} } \right) $$

(14)

A flattened layer is utilized in the hybrid model to transform the matrix into a unique vector (Eq. (15)), so that the matrix may be adapted to the ANN model input.

$$ Z = f\left( q \right) $$

(15)

ANN model is used as input for the output of the flattened layer (Z) (Eq. (16)).

$$ y\left( x \right) = L\left( {\mathop \sum \limits_{j = 1}^{N} w_{j} \left( p \right).\,Z_{j} \left( p \right) + c} \right) $$

(16)

where $y\left( x \right)$ has been predicted $G_{i}$ is the weight which links neurons to the input layer $w_{j} \left( p \right)$, the variable $Z_{j} \left( p \right)$ is the discrete input variable $t$ and the neuronal bias $c$, of the input variable, $L\left( . \right)$ is the hidden transfer function.

Hybrid CNN-LSTM-ANN architecture

The threefold hybrid model has been created to compare the effectiveness of the model in extracting the data by complementing each other in order to understand short and long-term relationships. As shown in Fig. 7, a recurrent neural network is added for this hybrid model which is running in cycles and is extremely proficient in sequence analysis. The combined LSTM helps to maintain the required data from earlier concealed countries compared to the CNN-ANN model. The input data are supplied with neurons to the hidden layer(s) 1D CNN, and then sent to the LSTM network in hidden states and ultimately the densely linked network that generates the overall model forecast. For this hybrid, the ANN model consists of different layers of neurons depending on the data set. The architecture of CNN and ANN is similar to the hybrid CNN-ANN concept mentioned above. This model contains fundamental computations integrated with the synthesis of neurons in the hidden layers of a hybrid model. This is described in four different phases ⁷⁸.

Phase One: The LSTM model determines the information that is thrown away from the $f_{t}$ forgotten gate in Eq. (16), according to the hidden state h_t-1, and the new input q_t is modeled with Eq. (20).

$$ f_{t} = \sigma \left( {W_{f} \times \left[ {h_{t - 1} ,q_{t} } \right] + b_{f} } \right) $$

(17)

where W_f is the matrix weight, the logistic sigmoid function is $\sigma \left( \ldots \right)$ and the bias function is $b_{f}$.

Phase Two: The information stored in the cell state is chosen in this step. There is also a new cell candidate ($\tilde{C}_{t}$) created by the 'input gate' i_t is likewise scaled.

$$ \tilde{C}_{t} = tanh\left( {W_{C} \times \left[ {h_{t - 1} ,q_{t} } \right] + b_{C} } \right) $$

(18)

$$ i_{t} = \sigma \left( {W_{i} \times \left[ {h_{t - 1} ,q_{t} } \right] + b_{i} } \right) $$

(19)

The hyperbolic tangent function in Eq. (18) is Tanh (…).

Phase Three: A combination of the earlier cell state C_t-1 and $\tilde{C}_{t}$. will update the new cell C_t. $f_{t}$ is affected and is also scalable by i_t. in the previous cell.

$$ C_{t} = f_{t} *C_{t - 1} + i_{t} *\tilde{C}_{t} $$

(20)

Phase Four: The final step is to divide the output into two stages and define the resulting cell state by creating an $o_{t}$"output gate." The tanh function triggered C_t is filtered by $o_{t}$. The outcome is the desired output $h_{t}$

$$ o_{t} = \sigma \left( {W_{o} \times \left[ {h_{t - 1} ,q_{t} } \right] + b_{o} } \right) $$

(21)

$$ h_{t} = o_{t} \times tanh\left( {C_{t} } \right) $$

(22)

The flattening layer transforms the matrix (Eq. (22)) into a single vector for this hybrid model.

$$ Z = f\left( {h_{t} } \right) $$

(23)

ANN model is used as input for the output of the flattened layer (Z) (Eq. (16)).

Data Acquisition and Preparation

The solar radiation dataset for this research is collected from three different databases namely; TMY⁷⁹, SARAH ⁸⁰, and WB-ESMAP⁸¹. These datasets have been measured for different and nine various specific locations within these countries. The specifics (including longitude, elevation, and latitude) of the locations from which these datasets were measured are summarized in Table 2. Since various solar irradiance types are considered in this study, the data timestep for the datasets also varies.

Table 2 Location details of research dataset.

Full size table

Training and testing of the models

The proposed and compared artificial intelligence (AI) models can be trained using different data sizes. While the hourly solar radiation prediction based on TMY considers 12 years of hourly data, 34 years of data is used for daily solar irradiance prediction. For the WB-ESMAP data which considers the prediction of solar irradiance with the timestep being minutes, 2 years of data were used for training/testing and the dataset summary is presented in Table 3. Also, for all the case studies, 90% of the data are used for training while the remaining 10% are the test dataset. The countries considered for the GSR task include Algeria, the Central African Republic (CAR), South Africa (SA), and Egypt. While Nigeria is considered for the daily average DNI task and hourly DSR task, Senegal is the only country considered for DHI_RSI, GHI_Sil, and GHI_pyr tasks (Table 2).

Table 3 Data training and test set summary.

Full size table

Since the dataset varies based on the database it was extracted from, the input layers of the dataset also differ. For the datasets from all the databases, three input nodes namely year, month, and day are constant. All the AI models designed for the TMY dataset use an input layer of 7 nodes and these nodes represent the input parameters. In addition to the 3 constant nodes for all the datasets, the other TMY input nodes are hour, ambient temperature, wind speed, and sun elevation. Also, the input layer of the models designed for solar irradiance prediction with the SARAH dataset has (1 node in addition to the aforementioned 3 nodes) a total of 4 nodes. The additional node is the daily sunshine duration. Furthermore, the AI models based on the WB-ESMAP dataset consider an input layer with 10 nodes. These nodes (input parameters) are wind speed, wind direction, precipitation, wind speed, air temperature, relative humidity, barometric pressure, and the other constant 3 nodes (Table 3).

Model implementation and evaluation metrics

Since these AI models are designed for African (developing) countries, the selection of the number of hidden layers and their corresponding neurons were strategically optimized to ensure fast computation, and optimal convergence, and to avoid model over-fitting. All the AI regression models have been built using the Tensorflow and Keras Application Programming Interface (API) and the mean square error (MSE) in Eq. (24) has been adopted as the loss function while (ReLU) is used as the (nonlinear) activation function. For the deep learning models, the feedforward computation is completed, resulting in the model’s predicted value. This value is compared to the ground truth value or label and the loss is computed. Backpropagation is employed to find the derivative of the model parameters and the cost function is minimized using the “Adam” optimizer. All the AI models were implemented in a Python environment (via Jupyter notebook) which runs with a Core i7, 2.20 GHz system with 16 GB RAM, and GTX1060 6 GB Graphics card.

To have the same basis for comparison, the three most common evaluation metrics for numerical AI tasks are adopted in this study to evaluate the performance of all the models. These include root mean square error (RMSE), mean absolute error (MAE), and correlation coefficient (r). These metrics were chosen based on their adoption in (solar radiation prediction) existing works of literature (in developing countries) ^6,30,82. The mathematical models of the following metrics are:

$$ MSE = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left( {G_{i}^{m} - G_{i}^{p} } \right)^{2} $$

(24)

$$ r = \left( {\frac{{\mathop \sum \nolimits_{i = 1}^{N} \left( {\left( {G_{i}^{m} - \left\langle {G_{i}^{m} } \right\rangle } \right)\left( {G_{i}^{p} - \left\langle {G_{i}^{p} } \right\rangle } \right)} \right)}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{N} \left( {G_{i}^{m} - \left\langle {G_{i}^{m} } \right\rangle } \right)^{2} } \sqrt {\mathop \sum \nolimits_{i = 1}^{N} \left( {G_{i}^{p} - \left\langle {G_{i}^{p} } \right\rangle } \right)^{2} } }}} \right)^{2} $$

(25)

$$ MAE = \frac{{\mathop \sum \nolimits_{N}^{t = 1} \left| {G_{i}^{m} - G_{i}^{p} } \right|}}{N} $$

(26)

$$ RMSE = \sqrt {\frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left( {G_{i}^{m} - G_{i}^{p} } \right)^{2} } $$

(27)

where $G_{i}^{m}$ is the measured value and $G_{i}^{p}$ represents the predicted value, and $ <{G_{i}^{m} } >$/$< {G_{i}^{p} }>$ are the average values of $G_{i}^{m}$ and $G_{i}^{p}$ respectively.

Results

In this study, the performance of 10 different artificial intelligence models has been compared for various solar irradiance prediction tasks in some selected developing (African countries). While most studies in existing literature have only focused on the hourly forecast of various solar radiation parameters, this study furthers the knowledge in literature by considering different timesteps namely minutes, hourly, and daily. Various solar irradiance parameters (from different measurement techniques) were also considered to highlight the intrinsic attention to detail of the AI models. Considering the technological developmental status of these countries, the models were built to be as simple as possible. In this section performance of all the AI models is discussed. The discussion is presented in three subsections following the timesteps of the solar irradiance parameters.

Daily average direct normal irradiance prediction

The average daily solar irradiance prediction task considers two locations (namely Akure and Abuja) in Nigeria. Also, the specific solar parameter considered is direct normal solar irradiance (DNI) and this is integral to the performance/ development of many solar-based technologies. The number of hidden layers (as well as the number of neurons in each hidden layer) in each AI model is summarized in Table 4. Also, the optimal number of training epochs and training batch size for each of the models are presented in the same table. This highlights the simplicity of these models and their adaptability to the targeted developing countries.

Table 4 Optimal AI training parameters for daily DNI task.

Full size table

Furthermore, the performance of all the models based on the three evaluation metrics used in this study is tabulated in Table 5. Specifically, for Abuja_DNI prediction, two models (DTR and MLR) were found unsuitable for this AI task. This is due to the high RMSE and MAE as well as the low r-value (Table 5). In this study, the models were tasked to forecast the daily average DNI for 3.4 years and the forecasted results in comparison to the real data are compared in Fig. 8a. However, a more detailed pictorial representation (in Fig. 8b) of the forecasted result showed the inadequacies of MLR and DTR. While the performances of ANN, CNN-ANN, and LSTM are quite similar, the most suitable AI models for the Abuja_DNI prediction tasks are CNN-LSTM-ANN and XGB. However, XGB is preferable due to its unsupervised learning characteristics and its fast computational time when compared with CNN-LSTM-ANN.

Table 5 Daily DNI task evaluation metric summary.

Full size table

It is also noteworthy that XGB has the least MAE and RMSE (40.78282 W/m² and 53.73310 W/m² respectively) as well as the least r-value (0.800087) as highlighted in Table 5. The new hybrid deep learning CNN-LSTM-ANN model presented in this study is a viable alternative to XGB as the performance of this model differs slightly. While the CNN-LSTM-ANN r-value is 0.79643, the RMSE and MAE are 41.48851 W/m² and 24.68782 W/m² respectively. The close proximity of this model results (forecasted DNIs) to that of the real data in Fig. 8b further highlights its potency.

The AI models’ performance for the same task considering another location (Akure_DNI) has a similar pattern to its corresponding Abuja_DNI AI models. Although the only AI model that seems unsuitable for this task is DTR, its performance based on the evaluation metrics is still higher when compared to the Abuja_DNI task (Table 5). The difference in model performance between Abuja_DNI and Akure_DNI prediction tasks can be attributed to the solar distribution in these locations. Akure as a location has a more distributed daily average DNI when compared with Abuja (as seen in Fig. 9a as compared to Fig. 8a), hence the high predictive performance by all the AI models.

While all the models (with the exception of DTR) recorded a good performance for the Akure_DNI prediction task, the best models for this particular task are ANN and XGB. The r-value, RMSE and MAE for these models respectively are 0.948073, 25.14591 W/m², and 19.10983 W/m² for ANN; 0.949997, 24.68782 W/m², and 18.52771 W/m² for XGB. The supervised learning feature of ANN creates room for further improvement of the model (especially when applied in other locations), however, the ANN model overfitting problem should be avoided. As seen in Fig. 9b, the forecasted Akure_DNI with XGB has the closest proximity to the real data. Therefore, it can be inferred that XGB models are most suitable for DNI daily average DNI forecasting.

Hourly solar radiation forecast

The hourly solar radiation prediction task in this study considers both diffused solar radiation (DSR) and global solar radiation (GSR). The AI models developed for this prediction task are adapted to five locations across Algeria, Nigeria, CAR, Egypt, and South Africa (Table 2). Due to the variation in location, the training parameters for the deep (supervised) learning AI models are optimized to achieve the best predictive performance in each location. Hence, the optimal batch size, number of epochs, number of hidden layers as well as the number of neurons in each hidden layer for all the deep learning models used are highlighted in Table 6.

Table 6 Optimal AI training parameters for hourly SR task.

Full size table

Out of all the 10 AI models presented in this study, six models have a very good predictive performance on the evaluation metrics results (Table 7). These models are ANN, CNN-ANN, CNN-LSTM-ANN, CNN, PLR, and XGB. The predictive output data (results) in comparison to the real data for all the models over the total test period (for all the location that considers hourly solar radiation forecast) is illustrated (in Fig. A) in the appendix section of this study. From the results of this study, it can also be deduced that the MLR model is not suitable for this specific task (Fig. 10a).

Table 7 Hourly SR task evaluation metric summary.

Full size table

The hybrid CNN-LSTM-ANN AI model proposed in this study recorded the best predictive performance for the Algeria_GSR task with an r-value, RMSE, and MAE of 0.977527, 81.101 W/m², and 30.8785 W/m². However, the close proximity of ANN, XGB, and CNN-ANN are evident in their predictive performance over a period of 72 h (Fig. 10a). The performance of the models presented in this study further strengthens existing works of literature in this field as the accuracies are higher than some of the reported results in literature.

Unlike Algeria, the hourly solar radiation prediction task for the location in Nigeria considers diffused solar radiation (DSR). While the r-values of the AI models developed for this task are comparatively smaller than that of the GSR task for other countries, the RMSE and MAE are also smaller. This is due to the statistical and meteorological distribution (as seen in Fig. 10b) of DSR when compared with GSR.

It is also noteworthy that most of the existing works of literature in the domain of solar radiation prediction worked on GSR hourly prediction. Therefore, this study further contributes to the literature as these AI models have been optimized for DSR prediction. While six AI models had high predictive performance when used for the Nigeria_DSR task, XGB is the most superior of all the models. As highlighted in Table 7, the RMSE, MAE, and r-value for the XGB model, when used for the Nigeria_DSR task, are 49.1553 W/m², 17.0214 W/m², and 0.904992. The predicted data for all the AI models are compared with the real data over a period of 72 h and highlighted in Fig. 10b.

The other three countries considered for the solar radiation task in this study are CAR, Egypt, and South Africa. The AI models were developed for GSR hourly prediction tasks in this study and the performance of each of these models is highlighted in Table 7. The models that are suitable for the CAR_GSR task are ANN, CNN-ANN, XGB, and PLR. Considering the evaluation metrics (r = 0.965303, MAE = 45.5573 W/m², RMSE = 95.9444 W/m² in Table 7) and the predictive output data plotted in Fig. 10c, ANN is the most suitable AI model for CAR_GSR forecast task.

It is noteworthy that the high MAE and RMSE values reported in this study for hourly solar radiation are due to the GSR unit. While the unit of GSR in this study is W/m², in most literatures, kW/m² is the unit adopted for GSR, hence the lower MAE and RMSE reported in these studies.

The performance of the AI models for the Egypt_GSR prediction task is the best in this entire study and this is due to the high solar intensity and good solar radiation distribution in the location chosen for this country. As seen in Fig. 10d. and Table 7, the most accurate model for GSR prediction in this location is the proposed CNN-LSTM-ANN model in this study. The r-value, RMSE, and MAE of the model are 0.987936, 60.49804 W/m², and 22.31752 W/m² respectively and these are the best evaluation metrics considering all the AI models for this particular location. Although the performance of XGB is quite similar to the CNN-LSTM-ANN model, the supervised learning nature of the model resulted in a better performance when compared to the XGB model. It is also worth noting that all the deep (supervised) learning models in this study have the capacity to give an accurate prediction of hourly solar radiation.

The last location considered for the GSR prediction (in a developing country context) is in South Africa. The performance (considering the r-value) of all the models (except DTR) is very similar for this location. However, as illustrated in Fig. 10e, the GSR forecast using the XGB model is the closest to the real data. This model had the least RMSE and MAE (91.15934 W/m² and 32.59973 W/m² respectively) as well as the highest r-value (0.968881) as highlighted in Table 7. The locations selected for the hourly solar radiation tasks in this study have been chosen considering data availability and good solar radiation potential. The fast computation speed for all the AI models in this study based on the models’ parameters further showcases their potency in application.

Solar irradiance prediction based on minutes timestep

One of the outstanding contributions of this present study is the development of AI models to forecast solar irradiance based on minutes timestep. Existing works of literature have majorly focused on the hourly solar irradiance prediction, however, the knowledge of solar irradiance minute by minute will further enhance the estimation of energy production from solar-based technology. Two locations in Senegal have been considered and three different measurement techniques for each location. The optimized training parameters for the deep learning models applied for each task are summarized in Table 8.

Table 8 Optimal AI training parameters for minute-ahead solar irradiance task.

Full size table

One of the things noticed for the preliminary training of all the datasets in this category with the AI models is that the PLR cannot perform this prediction task. Therefore, nine AI models are considered in this section for the solar irradiance prediction task. Generally, the predictive performance of the models (based on the evaluation metrics) shows that it is more difficult for the AI models to accurately forecast solar irradiance minute-by-minute when compared with its corresponding hourly or daily AI models. The nine AI models were tested by using it to forecast the diffused and global horizontal irradiance (DHI_RSI, GHI_pyr, and GHI_Sil) for 39 days in the two locations in Senegal. The forecasted results for Senegal_Toubal are plotted against the actual data and illustrated (in Fig. B) in the Appendix section. However, a day-ahead forecast is also conducted for Senegal_Toubal with the AI models and the results are illustrated in Fig. 11a and b.

Unlike other solar parameters prediction tasks or scenarios in this study (where various models are most suitable for different locations/solar parameters), the training/testing of the solar irradiance in this section showed that the XGB model is the most suitable in all the locations. As seen in Table 8, the AI models have a better performance for DHI_RSI and GHI_pyr in Senegal_Touba when compared to Senegal_Fatick. While the XGB model performance for DHI_RSI forecast task in Sengeal_Touba are r = 0.778685, RMSE = 104.911 W/m², and MAE = 69.41538 W/m², the corresponding best model (XGB) for Senegal_Fatick location are r = 0.727731, RMSE = 118.5533 W/m², and MAE = 82.44148 W/m² (Table 9). As seen in Fig. 11a, while the CNN-LSTM-ANN, LSTM, and ANN models can learn the data part, the proximity of the forecasted data based on the XGB model is better for most of the minutes in the day-ahead task. The plotted results in Fig. 11b and c further confirm the superiority of the XGB model as it follows the real data pattern.

Table 9 Minutes timestep SR task evaluation metric summary.

Full size table

Brief summary and discussion

Ten AI models have been used as the basis for developing specific algorithms to forecast solar irradiance parameters in this study. Considering the under-development and economic status of many developing countries, the AI models in this study have been adapted for this solar radiation forecast task in six developing (African) countries. It is worth noting that the applicability and the usefulness of the models are beyond developing countries. While two locations in Nigeria were considered for the daily average DNI task, another location in the same country is considered for the hourly average DSR estimation task. Similarly, two locations in Senegal were considered for the estimation of solar irradiance (DHI_RSI, GHI_pyr, and GHI_Sil) estimation task based on minutes timestep. Also, four locations in different countries have been used for GSR estimation. In summary, a total of 13 solar irradiance estimation tasks were carried out in this study considering 10 AI models for each task.

With the aim to check if there is a universal model for solar parameter estimation in developing countries, the results of this study show that various AI models are suitable for different solar irradiance estimations. However, the deep learning models (ANN, LSTM, and CNN), the hybrid deep learning models (CNN-ANN, and CNN-LSTM-ANN) as well as the XGB model has better predictive performance when compared to other models in most location. The results for the prediction of solar irradiance in minutes showed that XGB is the best model for this task in all the locations considered. Also, despite the change in solar measurement parameters in minutes timestep, the performance of the XGB model was relatively suitable for the task. It is, however, noteworthy that the AI models had the least predictive accuracy when considering the minutes' timesteps.

Similarly, the XGB model is the most suitable model for daily average DNI estimation. While PLR and CNN-LSTM-ANN models had a comparatively good performance for this task, the prediction errors recorded by the XGB models are significantly lower. The daily average DNI estimation further shows the novelty of this study as the performance of the models for the Nigeria_Akure_DNI task is better in comparison to existing works of literature. The evaluation metrics for this specific task are r = 0.949997, RMSE = 24.68782, and MAE = 18.52771.

Deep learning models and XGB models are most suited for the hourly solar radiation task. While the innovative hybrid deep learning model (CNN-LSTM-ANN) proposed in this study is most suitable for GSR prediction in Northern African countries, the XGB model reported the best performance for Nigeria and South Africa. Also, the hourly solar radiation estimation accuracy is very high, hence it dominant in existing solar radiation research.

From this study, it can also be deduced that some AI models are not applicable for some specific solar irradiance tasks. PLR model could not learn any of the minute timestep tasks while DTR models also had a bad predictive performance for daily average DNI task. Therefore, these models can be excluded from these specific tasks in the future as they are machine (unsupervised) learning algorithms.

Conclusions

Based on the results of this study, all the models presented in this study showed their suitability for various solar irradiance prediction tasks. However, the XGB model can be concluded as the best model for solar irradiance prediction tasks out of all the developed AI algorithms considered that was considered within the scope of this research. This is due to its consistently high performance in all the tasks in the study. Despite the change in location and solar parameters, the XGB model had a relatively high performance/accuracy for all the tasks. While the results of the models in the study are better than some existing works of literature, the accuracy of the forecasted solar irradiance shows that more researches on the use of other AI models (such as reinforcement learning models and the developments of new hybrid AI models) are required.

In the future, more research will focus on the accurate prediction of solar irradiance considering the minutes' timestep. While this is the first study to present this (to the best knowledge of the authors), the estimation of solar irradiance in minutes will further help in forecasting solar technology’s production accurately. Thereby, improving the overall development of the solar energy sector.

Data availability

The datasets generated and/or analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

$\hat{Y}$ :: Random forest algorithm's predictive value
$n, \,\,N, \,\,and\,\, T_{n} \left( x \right).$ :: Random forest algorithm's predictive mean values
N:: Maximum number of samples
$X$ :: Number of random forest decision trees in N
Σ:: Sigma
$Y$ :: The goal of the polynomial regression
$x$ :: Polynomial regression predictor
$\theta_{0}$ :: Polynomial regression bias
$\theta_{0} , \theta_{1} , \ldots . \theta_{n}$ :: The weight of the equation of regression
$n$ :: Polynomial degree
$\left( {x_{1} ,x_{2} ,x_{3} ,x_{4} } \right)$ :: Multiple linear regression independent variables
$\left( {\hat{y}} \right)$ :: Multiple linear regression dependent variable
$b_{0}$ :: Multiple linear regression Y-axis cut-off point for the adjusted regression curve
$b_{1} ,b_{2} ,b_{3} ,b_{4}$ :: Multiple linear regression first variable of guessing $x_{1} ,\,\,x_{2} ,\,\,x_{3} ,\,\,x_{4}$
$\left( {R_{1} , R_{2} ,R_{3} \ldots ..R_{J} } \right)$ :: Decision tree regression predictor space jth regions
$x_{t}$ :: Long short-term memory current input
$C_{t}$ :: Long short-term memory new cell states
$C_{t - 1}$ :: Long short-term memory predecessor cell states
$h_{t}$ :: Long short-term memory current cell outputs
$h_{t - 1}$ :: Long short-term memory preceding cell outputs
$W_{i}$ :: Long short-term memory sigmoid output
$\left( {\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{C}_{t} } \right)$ :: Long short-term memory information
$b_{i}$ :: Long short-term memory gate bias
$W_{f}$ :: Long short-term memory weight matrix
$W_{0} \,\,{\text{and}}\,\,b_{0}$ :: Long short-term memory weighted matrices of the output gate and LSTM bias respectively
$i and j,\,\,\,a_{i} and b_{i}$ :: Indexes of the artificial neural network neurons
X and Y:: Artificial neural network input neurons
$h_{j}$ :: Artificial neural network hidden layer
$S$ :: Artificial neural network activation function
$I$ :: Convolutional neural network input matrix
$K$ :: Convolutional neural network 2D filter of size $m to n$
$F$ :: Convolutional neural network 2D feature map output
$I*K$ :: Indicates the functioning of the convolutionary layer
W^k :: CNN-ANN kernel weight associated with the Kth feature map
f :: CNN-ANN activation feature
*:: CNN-ANN operator
c:: CNN-ANN output
$G_{i}$ :: CNN-ANN weight which links neurons to the input layer $w_{j} \left( p \right)$
$Z_{j} \left( p \right)$ :: CNN-ANN discrete input variable $t$ and the neuronal bias $c$, of the input variable
$y\left( x \right)$ :: CNN-ANN prediction
$L\left( . \right)$ :: CNN-ANN hidden transfer function
$f_{t}$ :: CNN-LSTM-ann forgotten gate
h_t-1 :: CNN-LSTM-ANN hidden state
q_t :: CNN-LSTM-ANN new input
$\sigma \left( \ldots \right)$ :: CNN-LSTM-ANN logistic sigmoid function
$b_{f}$ :: CNN-LSTM-ANN bias function
$\left( {\tilde{C}_{t} } \right)$ :: CNN-LSTM-ANN new cell candidate
i _t :: CNN-LSTM-ANN scaling factor
$o_{t}$ :: CNN-LSTM-ANN output gate
$h_{t}$ :: CNN-LSTM-ANN desired output

References

Guijo-Rubio, D. et al. Evolutionary artificial neural networks for accurate solar radiation prediction. Energy https://doi.org/10.1016/j.energy.2020.118374 (2020).
Article Google Scholar
Solangi, K. H., Islam, M. R., Saidur, R., Rahim, N. A. & Fayaz, H. A review on global solar energy policy. Renew. Sustain. Energy Rev. https://doi.org/10.1016/j.rser.2011.01.007 (2011).
Article Google Scholar
Sarkodie, S. A., Adams, S. & Leirvik, T. Foreign direct investment and renewable energy in climate change mitigation: Does governance matter?. J. Clean. Prod. https://doi.org/10.1016/j.jclepro.2020.121262 (2020).
Article Google Scholar
International Energy Agency, Key world energy statistics 2018 energy statistics, Report (2018).
Ghimire, S., Deo, R. C., Raj, N. & Mi, J. Wavelet-based 3-phase hybrid SVR model trained with satellite-derived predictors, particle swarm optimization and maximum overlap discrete wavelet transform for solar radiation prediction. Renew. Sustain. Energy Rev. 113, 2019. https://doi.org/10.1016/j.rser.2019.109247 (2019).
Article Google Scholar
Govindasamy, T. R. & Chetty, N. Machine learning models to quantify the influence of PM10 aerosol concentration on global solar radiation prediction in South Africa. Clean. Eng. Technol. 2, 100042. https://doi.org/10.1016/j.clet.2021.100042 (2021).
Article Google Scholar
Abedinia, O., Zareinejad, M., Doranehgard, M. H., Fathi, G. & Ghadimi, N. Optimal offering and bidding strategies of renewable energy based large consumer using a novel hybrid robust-stochastic approach. J. Clean. Prod. 215, 878–889. https://doi.org/10.1016/j.jclepro.2019.01.085 (2019).
Article Google Scholar
Dong, J. et al. Novel stochastic methods to predict short-term solar radiation and photovoltaic power. Renew. Energy 145, 333–346. https://doi.org/10.1016/j.renene.2019.05.073 (2020).
Article Google Scholar
Zhou, Y., Liu, Y., Wang, D., Liu, X. & Wang, Y. A review on global solar radiation prediction with machine learning models in a comprehensive perspective. Energy Convers. Manag. https://doi.org/10.1016/j.enconman.2021.113960 (2021).
Article Google Scholar
Rigollier, C., Lefèvre, M. & Wald, L. The method Heliosat-2 for deriving shortwave solar radiation from satellite images. Sol. Energy 77(2), 159–169. https://doi.org/10.1016/j.solener.2004.04.017 (2004).
Article ADS Google Scholar
Jiang, Y. Computation of monthly mean daily global solar radiation in China using artificial neural networks and comparison with other empirical models. Energy 34(9), 1276–1283. https://doi.org/10.1016/j.energy.2009.05.009 (2009).
Article Google Scholar
Shadab, A., Said, S. & Ahmad, S. Box–Jenkins multiplicative ARIMA modeling for prediction of solar radiation: a case study. Int. J. Energy Water Resour. 3(4), 305–318. https://doi.org/10.1007/s42108-019-00037-5 (2019).
Article Google Scholar
Hai, T. et al. Global solar radiation estimation and climatic variability analysis using extreme learning machine based predictive model. IEEE Access 8, 12026–12042. https://doi.org/10.1109/ACCESS.2020.2965303 (2020).
Article Google Scholar
Rodríguez-Benítez, F. J. et al. A short-term solar radiation forecasting system for the Iberian Peninsula. Part 1: models description and performance assessment. Sol. Energy 195, 396–412. https://doi.org/10.1016/j.solener.2019.11.028 (2020).
Article ADS Google Scholar
Gürel, A. E., Ağbulut, Ü. & Biçen, Y. Assessment of machine learning, time series, response surface methodology and empirical models in prediction of global solar radiation. J. Clean. Prod. https://doi.org/10.1016/j.jclepro.2020.122353 (2020).
Article Google Scholar
Sun, S., Wang, S., Zhang, G. & Zheng, J. A decomposition-clustering-ensemble learning approach for solar radiation forecasting. Sol. Energy 163, 189–199. https://doi.org/10.1016/j.solener.2018.02.006 (2018).
Article ADS Google Scholar
Belmahdi, B., Louzazni, M. & El Bouardi, A. One month-ahead forecasting of mean daily global solar radiation using time series models. Optik (Stuttg). 219, 165207. https://doi.org/10.1016/j.ijleo.2020.165207 (2020).
Article ADS Google Scholar
Blal, M. et al. A prediction models for estimating global solar radiation and evaluation meteorological effect on solar radiation potential under several weather conditions at the surface of Adrar environment. Meas. J. Int. Meas. Confed. 152, 107348. https://doi.org/10.1016/j.measurement.2019.107348 (2020).
Article Google Scholar
Heng, J., Wang, J., Xiao, L. & Lu, H. Research and application of a combined model based on frequent pattern growth algorithm and multi-objective optimization for solar radiation forecasting. Appl. Energy 208, 845–866. https://doi.org/10.1016/j.apenergy.2017.09.063 (2017).
Article Google Scholar
Kisi, O., Heddam, S. & Yaseen, Z. M. The implementation of univariable scheme-based air temperature for solar radiation prediction: New development of dynamic evolving neural-fuzzy inference system model. Appl. Energy 241, 184–195. https://doi.org/10.1016/j.apenergy.2019.03.089 (2019).
Article Google Scholar
Ghimire, S., Deo, R. C., Raj, N. & Mi, J. Deep solar radiation forecasting with convolutional neural network and long short-term memory network algorithms. Appl. Energy 253, 113541. https://doi.org/10.1016/j.apenergy.2019.113541 (2019).
Article Google Scholar
Rodríguez-Benítez, F. J. et al. Assessment of new solar radiation nowcasting methods based on sky-camera and satellite imagery. Appl. Energy 292, 116838. https://doi.org/10.1016/j.apenergy.2021.116838 (2021).
Article Google Scholar
Peng, T., Zhang, C., Zhou, J. & Nazir, M. S. An integrated framework of Bi-directional long-short term memory (BiLSTM) based on sine cosine algorithm for hourly solar radiation forecasting. Energy 221, 119887. https://doi.org/10.1016/j.energy.2021.119887 (2021).
Article Google Scholar
del Campo-Ávila, J., Takilalte, A., Bifet, A. & Mora-López, L. Binding data mining and expert knowledge for one-day-ahead prediction of hourly global solar radiation. Expert Syst. Appl. 167, 114147. https://doi.org/10.1016/j.eswa.2020.114147 (2021).
Article Google Scholar
Lai, C. S., Zhong, C., Pan, K., Ng, W. W. Y. & Lai, L. L. A deep learning based hybrid method for hourly solar radiation forecasting. Expert Syst. Appl. https://doi.org/10.1016/j.eswa.2021.114941 (2021).
Article Google Scholar
Guermoui, M., Melgani, F. & Danilo, C. Multi-step ahead forecasting of daily global and direct solar radiation: a review and case study of Ghardaia region. J. Clean. Prod. 201, 716–734. https://doi.org/10.1016/j.jclepro.2018.08.006 (2018).
Article Google Scholar
Zhou, Y. et al. A novel combined multi-task learning and Gaussian process regression model for the prediction of multi-timescale and multi-component of solar radiation. J. Clean. Prod. 284, 124710. https://doi.org/10.1016/j.jclepro.2020.124710 (2021).
Article Google Scholar
Makade, R. G., Chakrabarti, S. & Jamil, B. Development of global solar radiation models: a comprehensive review and statistical analysis for Indian regions. J. Clean. Prod. 293, 126208. https://doi.org/10.1016/j.jclepro.2021.126208 (2021).
Article Google Scholar
Prasad, R., Ali, M., Xiang, Y. & Khan, H. A double decomposition-based modelling approach to forecast weekly solar radiation. Renew. Energy 152, 9–22. https://doi.org/10.1016/j.renene.2020.01.005 (2020).
Article Google Scholar
Pang, Z., Niu, F. & O’Neill, Z. Solar radiation prediction using recurrent neural network and artificial neural network: a case study with comparisons. Renew. Energy 156, 279–289. https://doi.org/10.1016/j.renene.2020.04.042 (2020).
Article Google Scholar
Puah, B. K. et al. A regression unsupervised incremental learning algorithm for solar irradiance prediction. Renew. Energy 164, 908–925. https://doi.org/10.1016/j.renene.2020.09.080 (2021).
Article Google Scholar
Narvaez, G., Giraldo, L. F., Bressan, M. & Pantoja, A. Machine learning for site-adaptation and solar radiation forecasting. Renew. Energy 167, 333–342. https://doi.org/10.1016/j.renene.2020.11.089 (2021).
Article Google Scholar
Karaman, Ö. A., Ağır, T. T. & Arsel, İ. Estimation of solar radiation using modern methods. Alexandria Eng. J. 60(2), 2447–2455. https://doi.org/10.1016/j.aej.2020.12.048 (2021).
Article Google Scholar
Ağbulut, Ü., Gürel, A. E. & Biçen, Y. Prediction of daily global solar radiation using different machine learning algorithms: Evaluation and comparison. Renew. Sustain. Energy Rev. 135(March), 2021. https://doi.org/10.1016/j.rser.2020.110114 (2020).
Article Google Scholar
Al-Rousan, N., Al-Najjar, H. & Alomari, O. Assessment of predicting hourly global solar radiation in Jordan based on Rules, Trees, Meta, Lazy and Function prediction methods. Sustain. Energy Technol. Assessments 44, 100923. https://doi.org/10.1016/j.seta.2020.100923 (2021).
Article Google Scholar
Das, S. Short term forecasting of solar radiation and power output of 89.6kWp solar PV power plant. Mater. Today Proc. 39, 1959–1969. https://doi.org/10.1016/j.matpr.2020.08.449 (2019).
Article Google Scholar
Bounoua, Z., Chahidi, L. O. & Mechaqrane, A. Estimation of daily global solar radiation using empirical and machine-learning methods: A case study of five Moroccan locations. Sustain. Mater. Technol. 28, e00261. https://doi.org/10.1016/j.susmat.2021.e00261 (2021).
Article Google Scholar
Shadab, A., Ahmad, S. & Said, S. Spatial forecasting of solar radiation using ARIMA model. Remote Sens. Appl. Soc. Environ. 20, 100427. https://doi.org/10.1016/j.rsase.2020.100427 (2020).
Article Google Scholar
Srivastava, R., Tiwari, A. N. & Giri, V. K. Solar radiation forecasting using MARS, CART, M5, and random forest model: a case study for India. Heliyon 5(10), e02692. https://doi.org/10.1016/j.heliyon.2019.e02692 (2019).
Article PubMed PubMed Central Google Scholar
Sharafati, A. et al. The potential of novel data mining models for global solar radiation prediction. Int. J. Environ. Sci. Technol. 16(11), 7147–7164. https://doi.org/10.1007/s13762-019-02344-0 (2019).
Article Google Scholar
Tao, H. et al. Global solar radiation prediction over North Dakota using air temperature: development of novel hybrid intelligence model. Energy Rep. 7, 136–157. https://doi.org/10.1016/j.egyr.2020.11.033 (2021).
Article Google Scholar
Bamisile, O. et al. Comparison of machine learning and deep learning algorithms for hourly global/diffuse solar radiation predictions. Int. J. Energy Res. https://doi.org/10.1002/er.6529 (2021).
Article Google Scholar
TSMS, “Turkish State Meteorological Service,” 2020. https://mgm.gov.tr/eng/forecast-cities.aspx (accessed Jan. 07, 2020).
Zang, H., Xu, Q. & Bian, H. Generation of typical solar radiation data for different climates of China. Energy 38(1), 236–248. https://doi.org/10.1016/j.energy.2011.12.008 (2012).
Article Google Scholar
Wang, H., Lei, Z., Zhang, X., Zhou, B. & Peng, J. A review of deep learning for renewable energy forecasting. Energy Conv. Manage. https://doi.org/10.1016/j.enconman.2019.111799 (2019).
Article Google Scholar
Ahmed, R., Sreeram, V., Mishra, Y. & Arif, M. D. A review and evaluation of the state-of-the-art in PV solar power forecasting: techniques and optimization. Renew. Sustain. Energy Rev. https://doi.org/10.1016/j.rser.2020.109792 (2020).
Article Google Scholar
Ahmad, T., Zhang, H. & Yan, B. A review on renewable energy and electricity requirement forecasting models for smart grid and buildings. Sustain. Cities Soc. https://doi.org/10.1016/j.scs.2020.102052 (2020).
Article PubMed PubMed Central Google Scholar
Liu, H., Mi, X. & Li, Y. Smart deep learning based wind speed prediction model using wavelet packet decomposition, convolutional neural network and convolutional long short term memory network. Energy Convers. Manag. 166, 120–131. https://doi.org/10.1016/j.enconman.2018.04.021 (2018).
Article Google Scholar
Ren, S., Cao, X., Wei, Y. & Sun, J. Global refinement of random forest. in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol 07–12-June, 723–730, https://doi.org/10.1109/CVPR.2015.7298672 (2015).
Biau, G. & Scornet, E. A random forest guided tour. TEST https://doi.org/10.1007/s11749-016-0481-7 (2016).
Article MathSciNet MATH Google Scholar
Criminisi, A. Decision Forests: A Unified Framework for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning. (2011).
Ahmad, M. W., Mourshed, M. & Rezgui, Y. Trees vs Neurons: Comparison between random forest and ANN for high-resolution prediction of building energy consumption. Energy Build. https://doi.org/10.1016/j.enbuild.2017.04.038 (2017).
Article Google Scholar
Ibrahim, I. A. & Khatib, T. A novel hybrid model for hourly global solar radiation prediction using random forests technique and firefly algorithm. Energy Convers. Manag. https://doi.org/10.1016/j.enconman.2017.02.006 (2017).
Article Google Scholar
Sun, H. et al. Assessing the potential of random forest method for estimating solar radiation using air pollution index. Energy Convers. Manag. https://doi.org/10.1016/j.enconman.2016.04.051 (2016).
Article Google Scholar
Rezaie-Balf, M., Kim, S., Ghaemi, A. & Deo, R. Design and performance of two decomposition paradigms in forecasting daily solar radiation with evolutionary polynomial regression: wavelet transform versus ensemble empirical mode decomposition. in Predictive Modelling for Energy Management and Power Systems Engineering, (2021).
Dietterich, T. G. An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization,” Machine Learning. (2000).
Mienye, I. D., Sun, Y. & Wang, Z. Prediction performance of improved decision tree-based algorithms: a review. Procedia Manufacturing 35, 698–703. https://doi.org/10.1016/j.promfg.2019.06.011 (2019).
Article Google Scholar
Singh, N., Jena, S. & Panigrahi, C. K. A novel application of decision Tree classifier in solar irradiance prediction. Mater. Today Proc. https://doi.org/10.1016/j.matpr.2022.02.198 (2022).
Article PubMed PubMed Central Google Scholar
Liu, C., Wang, J., Xiao, D. & Liang, Q. Forecasting S&P 500 stock index using statistical learning models. Open J. Stat. https://doi.org/10.4236/ojs.2016.66086 (2016).
Article PubMed PubMed Central Google Scholar
Singh, H. Practical Machine Learning and Image Processing. (2019).
Choi, S. H. & Hur, J. Optimized-XG boost learner based bagging model for photovoltaic power forecasting. Trans. Korean Inst. Electr. Eng. https://doi.org/10.5370/KIEE.2020.69.7.978 (2020).
Article Google Scholar
Hochreiter, S. The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int. J. Uncertain. Fuzziness Knowlege-Based Syst. https://doi.org/10.1142/S0218488598000094 (1998).
Article MATH Google Scholar
Chen, G. A Gentle Tutorial of Recurrent Neural Network with Error Backpropagation, 1–10, (2016), [Online]. Available: http://arxiv.org/abs/1610.02583.
Rodrigues, P. C., Awe, O. O., Pimentel, J. S. & Mahmoudvand, R. Modelling the behaviour of currency exchange rates with singular spectrum analysis and artificial neural networks. Stats https://doi.org/10.3390/stats3020012 (2020).
Article Google Scholar
Notton, G., Voyant, C., Fouilloy, A., Duchaud, J. L. & Nivet, M. L. Some applications of ANN to solar radiation estimation and forecasting for energy applications. Appl. Sci. https://doi.org/10.3390/app9010209 (2019).
Article Google Scholar
Geetha, A. et al. Prediction of hourly solar radiation in Tamil Nadu using ANN model with different learning algorithms. Energy Rep. https://doi.org/10.1016/j.egyr.2021.11.190 (2022).
Article Google Scholar
Mukhtar, M. et al., Development and comparison of two novel hybrid neural network models for hourly solar radiation prediction, (2022).
Galvez, R. L., Bandala, A. A., Dadios, E. P., Vicerra, R. R. P. & Maningo, J. M. Z. Object Detection Using Convolutional Neural Networks,” https://doi.org/10.1109/TENCON.2018.8650517 (2019).
Zhang, Y., Ma, J., Zeng, C. & Li, G. Short-term global horizontal irradiance forecasting using a hybrid convolutional neural network-gate recurrent unit method. https://doi.org/10.1088/1742-6596/2025/1/012001 (2021).
Rai, A., Shrivastava, A. & Jana, K. C. A CNN-BiLSTM based deep learning model for mid-term solar radiation prediction, doi: https://doi.org/10.1002/2050-7038.12664 (2021).
Hasan, A. M., Jalab, H. A., Meziane, F., Kahtan, H. & Al-Ahmad, A. S. Combining deep and handcrafted image features for MRI brain scan classification. IEEE Access https://doi.org/10.1109/ACCESS.2019.2922691 (2019).
Article Google Scholar
Gu, J. et al. Recent advances in convolutional neural networks. Pattern Recognit. https://doi.org/10.1016/j.patcog.2017.10.013 (2018).
Article Google Scholar
Kutlu, H. & Avcı, E. A novel method for classifying liver and brain tumors using convolutional neural networks, discrete wavelet transform and long short-term memory networks. Sensors (Basel) https://doi.org/10.3390/s19091992 (2019).
Article Google Scholar
Singh, D., Kumar, V. & Kaur, M. Classification of COVID-19 patients from chest CT images using multi-objective differential evolution–based convolutional neural networks. Eur. J. Clin. Microbiol. Infect. Dis. https://doi.org/10.1007/s10096-020-03901-z (2020).
Article PubMed PubMed Central Google Scholar
Zegers, C. M. L. et al. Current applications of deep-learning in neuro-oncological MRI. Phys. Med. https://doi.org/10.1016/j.ejmp.2021.03.003 (2021).
Article PubMed Google Scholar
Chang, P. et al. Deep-learning convolutional neural networks accurately classify genetic mutations in gliomas. Am. J. Neuroradiol. https://doi.org/10.3174/ajnr.A5667 (2018).
Article PubMed PubMed Central Google Scholar
Ozcanli, A. K., Yaprakdal, F. & Baysal, M. Deep learning methods and applications for electrical power systems: A comprehensive review. Int. J. Energy Res. https://doi.org/10.1002/er.5331 (2020).
Article Google Scholar
Shi, X., Chen, Z., Wang, H., Yeung, D. Y., Wong, W. K. & Woo, W. C. Convolutional LSTM network: A machine learning approach for precipitation nowcasting, (2015).
E. Commission, “PHOTOVOLTAIC GEOGRAPHICAL INFORMATION SYSTEM (Typical meteorological year),” 2010. https://re.jrc.ec.europa.eu/pvg_tools/en/tools.html#TMY (accessed May 12, 2020).
SARAH, “EUMESAT CM SAF,” 2019. https://wui.cmsaf.eu/safira/action/viewDoiDetails?acronym=SARAH_V002_01 (accessed Mar. 09, 2021).
W. Bank, “The World Bank Data Catalog,” 2017. https://datacatalog.worldbank.org/search/type/dataset (accessed Dec. 05, 2020).
Meenal, R. & Selvakumar, A. I. Assessment of SVM, empirical and ANN based solar radiation prediction models with most influencing input parameters. Renew. Energy https://doi.org/10.1016/j.renene.2017.12.005 (2018).
Article Google Scholar

Download references

Acknowledgements

This study was supported by Sichuan Provincial Key Lab for Power System-Wide Area Measurement, Science and Technology Innovation Talent Program of Sichuan Provincial (Grant No. 22CXRC0010), and Science and Technology Innovation Talent Program of Sichuan Provincial (Grant No. 22CJDRC0025).

Author information

Authors and Affiliations

Sichuan Industrial Internet Intelligent Monitoring and Application Engineering Technology Research Centre, Chengdu University of Technology, Chenghua District, Chengdu, Sichuan, People’s Republic of China
Olusola Bamisile, Dongsheng Cai & Qi Huang
School of Software Engineering, University of Electronic Science and Technology of China, Chengdu, Sichuan, People’s Republic of China
Ariyo Oluwasanmi, Chukwuebuka Ejiyi & Chiagoziem C. Ukwuoma
IMDEA Networks Institute, 28918, Leganes, Madrid, Spain
Oluwasegun Ojo
Universidad Carlos III de Madrid, 28912, Leganes, Madrid, Spain
Oluwasegun Ojo
School of Economics and Management, Guangdong University of Petrochemical Technology, Maoming, 525000, China
Mustapha Mukhtar

Authors

Olusola Bamisile
View author publications
You can also search for this author in PubMed Google Scholar
Dongsheng Cai
View author publications
You can also search for this author in PubMed Google Scholar
Ariyo Oluwasanmi
View author publications
You can also search for this author in PubMed Google Scholar
Chukwuebuka Ejiyi
View author publications
You can also search for this author in PubMed Google Scholar
Chiagoziem C. Ukwuoma
View author publications
You can also search for this author in PubMed Google Scholar
Oluwasegun Ojo
View author publications
You can also search for this author in PubMed Google Scholar
Mustapha Mukhtar
View author publications
You can also search for this author in PubMed Google Scholar
Qi Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

O.B.: Conceptualization; Data curation; Formal analysis; Investigation; Methodology; Resources; Software; Supervision; Validation; Visualization; Roles/Writing - original draft; Writing - review & editing. D.C.: Methodology; Supervision; Funding acquisition; Validation; A.O.: Methodology; Data curation; Formal analysis; Investigation; Funding acquisition; Validation; C.E.: Methodology; Data curation; Formal analysis; Validation; C.C.U.: Methodology; Data curation; Formal analysis; Validation; O.O.: Methodology; Data curation; Formal analysis; Validation; M.M.: Methodology; Supervision; Validation; Q.H.: Methodology; Supervision; Validation;

Corresponding author

Correspondence to Dongsheng Cai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bamisile, O., Cai, D., Oluwasanmi, A. et al. Comprehensive assessment, review, and comparison of AI models for solar irradiance prediction based on different time/estimation intervals. Sci Rep 12, 9644 (2022). https://doi.org/10.1038/s41598-022-13652-w

Download citation

Received: 25 March 2022
Accepted: 17 May 2022
Published: 10 June 2022
DOI: https://doi.org/10.1038/s41598-022-13652-w

This article is cited by

The power of progressive active learning in floorplan images for energy assessment
- Dhoyazan Al-Turki
- Marios Kyriakou
- Mohammed M. Abdelsamea
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

A multilayer perceptron neural network approach for optimizing solar irradiance forecasting in Central Africa with meteorological insights

Solar power forecasting beneath diverse weather conditions using GD and LM-artificial neural networks

A novel GBDT-BiLSTM hybrid model on improving day-ahead photovoltaic prediction

Introduction

Machine learning and deep learning algorithms

Random forest regression

Polynomial regression

Multi-linear regression

Decision tree regression

XG BOOST

Long short-term memory (LSTM)

Artificial neural network (ANN)

Convolutional neural network (CNN)

Hybrid CNN-ANN architecture

Hybrid CNN-LSTM-ANN architecture

Data Acquisition and Preparation

Training and testing of the models

Model implementation and evaluation metrics

Results

Daily average direct normal irradiance prediction

Hourly solar radiation forecast

Solar irradiance prediction based on minutes timestep

Brief summary and discussion

Conclusions

Data availability

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

The power of progressive active learning in floorplan images for energy assessment

Comments

Search

Quick links