Elsevier

Applied Soft Computing

Volume 66, May 2018, Pages 250-263
Applied Soft Computing

Investigating the impact of feature selection on the prediction of solar radiation in different locations in Saudi Arabia

https://doi.org/10.1016/j.asoc.2018.02.029Get rights and content

Highlights

  • A study of different feature selection methods is carried out to predict the daily amounts of solar radiation in different locations in Saudi Arabia.

  • Four feature selection algorithms are applied: ReliefF algorithm, Monte Carlo uninformative variable elimination algorithm (MCUVE), random-frog algorithm, and Laplacian score algorithm (LS).

  • A computational intelligence model of a multi-layer neural network is used as the predictor.

  • The results showed the importance of using feature selection methods in order to obtain a reliable prediction of the amount of solar radiation compared with using all the features available.

Abstract

Predictions about the future amount of solar radiation is an important factor that affects the planning and operating of solar energy projects. However, it is a difficult task due to the existence of a high level of uncertainty that is associated with unknown future weather conditions. Accordingly, the prediction process, especially for the short term, requires more attention to unveil hidden relationships and interactions between related variables. Since a large number of parameters affect the estimation and prediction processes, there is a need to apply an effective and efficient feature selection of the input feature space. In this paper, an investigation of different feature selection methods is carried out in order to predict the daily amounts of solar radiation in different locations in Saudi Arabia using a neural networks (NN) predictor. First, the selection of the most important variables is carried out using four different algorithms: ReliefF algorithm, Monte Carlo uninformative variable elimination algorithm (MCUVE), random-frog algorithm, and Laplacian score algorithm (LS). Then, a computational intelligence model of a multi-layer neural network is used as the predictor. The predictor aims to predict the next-day global horizontal irradiance using selected meteorological and solar radiation observations. The experimentation and results of the feature selection methods and the prediction process are described. The results showed the importance of using feature selection methods in order to obtain a reliable prediction of the amount of solar radiation compared with using all the features available.

Introduction

Solar energy is linked with the variability of different types of weather, topography, and geography. Such variability is considered to be one of the main obstacles that impacts energy availability and, thus, prevents the optimal exploitation of solar energy, especially in the short term. To best exploit solar energy, a reliable prediction model is crucial. For any location, a reliable source of solar radiation data is a requirement in many associated applications [1]. The prediction process is affected by the uncertainty about future weather conditions, which are linked to a large number of meteorological and solar radiation variables in different time horizons. Historical solar radiation and meteorological records can be used to build automated prediction models that do not require experts knowledge. Therefore, the prediction process, especially for short-term, is tough task for researchers and operators. Accordingly, many models of solar radiation have been presented in the literature. Two of the main tracks taken to develop prediction models are numerical weather prediction (NWP) and artificial intelligence (AI) models (see [[2], [3], [4]]). In order to obtain reliable prediction models, there is a clear need to reduce the large number of meteorological and solar radiation variables into a representative set that enhance the accuracy of the prediction model and reduce the data redundancy before applying the prediction model.

This paper focuses on the investigation of the impact of the feature selection process on the prediction of solar radiation. The proposed methodology uses four different feature selection algorithms to determine the most important variables. A neural network predictor is then used to predict the next-day solar radiation for eight locations in Saudi Arabia. The background for this study is discussed in Section 2. The methodology, the results, and the analysis are described in Sections 3 and 4. Finally, the conclusions and future work are highlighted in Section 5.

Section snippets

Background

The objective of this study is to investigate the impacts of using feature selection methods prior to predicting next-day solar radiation. In this section, we review the problem considerations and the methods applied in this study.

Methodology

The experiment is made up of four main stages: preparing the data sets, selecting the features, building the neural networks predictor, and testing the new data set using the predictor model. Testing process uses new data sets that are different spatially and temporally from the training data. The experiment stages are illustrated in Fig. 2 where neural network models are built with and without the four feature selection algorithms. The experiment is carried out in the MATLAB environment using

Results and discussion

The experiment was run 50 times using a Matlab environment. The average, maximum, minimum, and standard deviations of the training and testing errors have been calculated using different error metrics, including root mean squared error (RMSE), mean absolute error (MAE), and mean bias error (MBE). The errors metrics are defined as follows:RMSE=1ni=1n(f(x)fˆ(x))2MAE=1ni=1n(f(x)fˆ(x))MBE=1ni=1n(f(x)fˆ(x)),

where f(x) is the predicted value, and fˆ(x) is the actual value. Estimation error

Conclusion and future works

In this article, the use of different feature selection algorithms and neural networks to predict daily solar radiation in eight locations in Saudi Arabia has been presented. According to the experimental results for feature selection, the use of four feature selection algorithms produced the best prediction behavior for all locations compared with algorithms that used all the features. The improvements in RMSE were found to be between 19.9% and 52.1% compared with the prediction values when

References (48)

  • A. Hepbasli et al.

    A key review on present status and future directions of solar energy studies and applications in Saudi Arabia

    Renew. Sustain. Energy Rev.

    (2011)
  • A.A. El-Sebaii et al.

    Estimation of global solar radiation on horizontal surfaces in Jeddah, Saudi Arabia

    Energy Policy

    (2009)
  • H. Bulut et al.

    Simple model for the generation of daily global solar-radiation data in Turkey

    Appl. Energy

    (2007)
  • S. Rehman et al.

    Spatial estimation of global solar radiation using geostatistics

    Renew. Energy

    (2000)
  • S.H. Alawaji

    Evaluation of solar energy research and its applications in Saudi Arabia: 20 years of experience

    Renew. Sustain. Energy Rev.

    (2001)
  • M. Benghanem et al.

    ANN-based modeling and estimation of daily global solar radiation data: a case study

    Energy Convers. Manag.

    (2009)
  • M. Benghanem et al.

    A multiple correlation between different solar parameters in Medina, Saudi Arabia

    Renew. Energy

    (2007)
  • M.A. Ramli et al.

    Investigating the performance of support vector machine and artificial neural networks in predicting solar radiation on a tilted surface: Saudi Arabia case study

    Energy Convers. Manag.

    (2015)
  • S. Salcedo-Sanz et al.

    A CRO-species optimization scheme for robust global solar radiation statistical downscaling

    Renew. Energy

    (2017)
  • A.K. Yadav et al.

    Application of rapid miner in ANN based prediction of solar radiation for assessment of solar energy resource potential of 76 sites in Northwestern India

    Renew. Sustain. Energy Rev.

    (2015)
  • H.D. Li et al.

    Random frog: an efficient reversible jump Markov chain Monte Carlo-like approach for variable selection with applications to gene selection and disease classification

    Anal. Chim. Acta

    (2012)
  • K. Hornik et al.

    Multilayer feedforward networks are universal approximators

    Neural Netw.

    (1989)
  • M. Mohandes et al.

    Estimation of global solar radiation using artificial neural networks

    Renew. Energy

    (1998)
  • T. Khatib et al.

    A review of solar energy modeling techniques

    Renew. Sustain. Energy Rev.

    (2012)
  • Cited by (42)

    • Semi-real-time decision tree ensemble algorithms for very short-term solar irradiance forecasting

      2024, International Journal of Electrical Power and Energy Systems
    • The role of input selection and climate pre-classification on the performance of neural networks irradiance models

      2022, Applied Soft Computing
      Citation Excerpt :

      In [31] an Automatic Weather Station (AWS) located in Abu Dhabi International Airport and its data from 2004 to 2013 was used to reveal that the variables most affecting model precision are sunshine duration and temperature. Next-day solar radiation prediction using neural networks using three algorithms for feature selection in three locations in Saudi Arabia (Al-Uyaynah, Hafar Al-Batin, and Al-Qunfuthah) is developed in [5]. The authors found that the predictions based on feature selection algorithms demonstrated better prediction behavior than using all the data.

    • Deep learning CNN-LSTM-MLP hybrid fusion model for feature optimizations and daily solar radiation prediction

      2022, Measurement: Journal of the International Measurement Confederation
      Citation Excerpt :

      It first describes the SMA approach for feature selection and then describe the foundations of the CNN algorithm and the LSTM approach. In this study, a wrapper feature selection method [85–87] based upon a meta-heuristic algorithm called Slime Mould Algorithm (SMA) is firstly used to select the optimal features for GSR prediction. We have selected SMA based on its recent performance as a metaheuristic algorithm derived from the diffusion and foraging behaviour of slime mould [88].

    • Solar energy modelling and forecasting using artificial neural networks: a review, a case study, and applications

      2022, Artificial Neural Networks for Renewable Energy Systems and Real-World Applications
    View all citing articles on Scopus

    This study is part of a funded research project by King Abdulaziz City for Science and Technology (KACST) (Grant Number: 13-ENES2373-10). In addition, the author thanks King Abdullah City for Atomic and Renewable Energy (KACARE) for providing data.

    View full text