Abstract

A short-term forecast of energy consumption is affected by different factors related to the demand in residential, commercial, thermoelectric, and industrial sectors. This demand can be strongly constrained by weather variables, especially temperatures, whose forecast may be very useful to predict the balances between supply and demand, minimizing the risk of price volatility. Energy companies use the relationship between meteorological forecast output and energy request to provide an effective scheduling of national gas and power grids and reduce operational costs in critical periods. This work reports a comparison analysis for short- and medium-term daily temperature forecasts during the period 2013-2014 by using the weather model e-kmf™ (eni-kassandra meteo forecast), currently adopted in gas and power applications where meteorological output has a key role. This weather forecast system uses different models and initial data to develop probabilistic predictions from a perspective of eleven days ahead. In particular, a set of model runs with horizontal grid spacing of 5.5, 8, 13, and 18 km with the same domain size are undertaken to assess the sensitivity of temperature to horizontal resolutions. A nonlinear Kalman filter has been also applied to postprocess forecasted data in eight European cities (Milano, Roma, Torino, Napoli, Munich, Paris, Brussels, and London). Filtered forecasts over these cities have been compared to local observations taken from SYNOP (surface synoptic observations) and METAR (meteorological Aerodrome Report) stations. Skill scores of performance have been used to generally assess the forecast reliability up to day +11. In order to understand the sensitivity to the horizontal resolution, investigations have been carried out even during four specific periods of two weeks with stable and unstable weather conditions.

1. Introduction

Energy consumption is strictly correlated to seasonal weather and climatic trends, in particular, for surface temperature and precipitation [1, 2]. By knowing the temperature on a specific geographical area and giving attention to anomalous phenomena, it is possible to improve the planning of storage, sale, and supply of energy reserves [3].

Efficient operation of modern energy distribution systems often requires forecasting of future energy demand [4, 5]. Energy companies use the link between temperature variability and energy demand for supply, planning to guard against shortages during the most critical times. For this reason, they are the most active users of weather forecasts and they continue to use these products in their weekly scheduling. Hence, the benefit on short- and medium-term forecasts for energy services lies, first of all, in advance warning for better energy distribution and management. The present study, provided by Eni S.p.A to obtain an accurate and high reliability forecast tool for improving the management on energy market for national and European countries, has been made by using the patented e-kmf™ weather forecast system (eni-Kassandra Meteo Forecast). By providing short- and medium-term temperature forecast over different European areas of interest and using different horizontal grid resolutions to assess their impacts on the quality of prediction, an hourly weather forecast application for gas turbine power generation in Italy has been also carried out [6].

A former preliminary analysis over a short period in the year 2013 has shown how the change of the e-kmf™ model spatial resolution in the forecast from day + 1 to day + 5 might reduce the MAE of 2–4°C and improve the performance. In this study, several simulations across 2013 to 2014 have been performed with the e-kmf™ model by changing its horizontal resolution and quantifying its impact in terms of computational cost and time to run the e-kmf™ model with different spatial grids. The benefits in energy market even in many European cities with a different climate compared to the Mediterranean one in Italy have been evaluated.

To assess the impacts of horizontal grid on the forecast results in the selected areas, vertical resolution and physics have been maintained the same for all simulations (as the daily operative 5.5 km model), with a forecast horizon up to day + 11. Temperature forecasts scaled on selected cities in Italy (Milano, Torino, Roma, and Napoli), Germany (Munich), Belgium (Brussels), France (Paris), and UK (London) have been compared against observations coming from SYNOP (surface synoptic observations) and METAR (meteorological aerodrome report) stations for a dataset of two years (2013-2014) and, then, for particular periods of two weeks with stable and unstable situations. After the description of the data analysis method in Section 2 and the e-kmf™ forecast system in Section 3, we analyse and discuss the weather benchmark dataset used in this paper. Finally, conclusions of this paper are reported in Section 5.

2. Observed and Forecasted Temperature Data

The area of study includes four Italian cities (Milano, Torino, Roma, and Napoli) and four European cities (London, Brussels, Paris, and Munich). Figure 1 shows the domain of the e-kmf™ weather forecast model for the selected European cities (green dots).

The e-kmf™ model forecasts for each urban centre of the eight selected cities at different spatial resolutions are compared with observed data taken from SYNOP and METAR stations in order to calculate the performance of forecasts at different lead times and for different spatial resolutions. The procedure to spatially and temporally average daily temperature forecasts from raw GRIB (Gridded Binary) files at a given lead time for each analysed city is afterwards described:(i)Hourly temperatures of the four closest grid points (, , , and ) of the meteorological model for each i-th analysed city are selected (Figure 2), where , , , and are the hourly temperatures of the north-west, north-east, south-west, south-east, respectively, grid point closest to the city.(ii)These four hourly temperatures of the four grid points closest to the selected city are averaged out with a simple inverse distance weighting (IDW) method (equation 1) to calculate a hourly mean temperature forecast over the city ():where , , , and are the square values of the distance (in km) between the north-west, north-east, south-west, south-east, respectively, grid points closest to the city and the city itself.(iii)The final step computes the daily temperature forecast () of the analysed city averaging out the hourly mean temperatures () as shown in the following equation:  forecast of the daily mean temperaturewhere n is equal to 24 for the e-kmf™ model since the model forecast output () is every 1 hour.

This aforementioned procedure is repeated for all the 8 cities at all lead times (eleven) of the model forecast (day + 0, day + 1, day + 2, etc.).

3. e-kmf™ Meteorological Model

In this study a multimodel approach based on regional and limited area models has been used [6, 7]. This local ensemble prediction system has been developed in the framework of the e-kmf™ weather forecast system where the forecast is produced by multiple modeling sources with varying physics and perturbed initial conditions. Ensemble forecasting provides a useful way of addressing variability in the initial conditions, uncertainties in model physics, and the inherent uncertainty in atmospheric forecast [3]. This system represents an effective attempt to simulate the atmospheric evolution and may provide insights into the range of uncertainties which can be found in both the initial conditions and models. Multimodel approaches, which use a single analysis, attempt to overcome modeling system deficiencies, while multiple analyses with a single modeling system may diagnose sensitivity to the initial conditions. The combination of these two approaches can maximize the benefits of each one by compensating for deficiencies in both the initial conditions and the modeling systems. The e-kmf™ consists of multicomponent ensembles: the global ensemble provides lateral boundary conditions to a regional ensemble which provides the boundary conditions to a limited area. In this setup, the regional ensemble provides the downscaling of the global ensemble; the local ensemble, in turn, provides the dynamical downscaling of the regional ensemble. Each higher resolution model provides the opportunity for small-scale features to grow in the ensemble. The local ensemble used in this study is applied on a domain covering the area in Figure 1, using 14 members (the same number of the regional ensemble), a spatial grid resolution varying from 5.5 km to 18 km with 42 vertical levels in all simulations, and the same number of levels of the global and the regional models. Dynamical cores are based on the WRF-ARW (Weather Research and Forecasting-Advanced Research), the WRF NMM (nonhydrostatic mesoscale model), and the latest version of Eta model [8]. For each simulation, the multimodel approach uses different physical and dynamic schemes for microphysics [9, 10], planetary boundary layer and surface layer [1116], cumulus parameterization [17, 18], radiation [1921], and land surface physics [2225].

Each run is then postprocessed through a Kalman filter to obtain local temperature predictions; the operative forecast output is the median temperature computed on these local filtered runs [6]. This e-kmf™ forecast system is suitable for several applications where forecast of meteorological variables is relevant (e.g., renewable energy, risk analysis, and planning), and it can be applied to any area worldwide.

3.1. Importance of the Horizontal Grid Resolution

The horizontal resolution of a NWP (numerical weather prediction) model is related to the spacing between grid points for grid point models or the number of waves that can be resolved for spectral models. Since the smallest features that can be accurately represented by a model are several times larger than the grid resolution, phenomena with dimensions on the same scale as the grid spacing are unlikely to be depicted or predicted within a model. Whether a model is considered at high or low resolution depends upon the size of the domain and the scale of weather phenomena that the model is trying to predict. A resolution on the order of 10 to 20 km may be considered high for a global model, while for a storm-scale model, a resolution of 500 m may be necessary in order to resolve the internal processes. Horizontal resolution has a direct impact on several characteristics such as vertical motions, downward fluxes, precipitation location and strengths, down-slope and valley winds, and mountain waves, among others.

Quite large grid meshes cannot resolve some characteristics of the circulations and, particularly, in complex areas, the vertical wind velocities, which do not affect only dynamics, but the thermal and moisture distribution, which has an influence on the wind characteristics and the occurrence and strength of precipitation. High-resolution mesoscale simulations can get better results to describe the circulation patterns, the surface parameters, and the precipitation amounts. Several years ago, in the framework of an integrated modeling approach at regional and local scales, simulations were performed in the Alpine area [26] and results about precipitations fairly well agreed with observations when horizontal resolutions fell below 10 km. Hence, higher resolutions were crucial to improve the location and the values of the precipitations. The simultaneous use of mesoscale models gave a further improvement in the description of weather dynamics and thermodynamics [27]. Many of the deficiencies found in the current global- and synoptic-scale models can therefore be mitigated in mesoscale and local models. The use of vertical coordinates and the vertical resolution also strongly impact the nature of terrain effects.

Many other studies showed the importance of higher resolution in improving the forecast skill [28], in different research fields as convective precipitation [29], fire danger [30], weather prediction, or climate simulations [31, 32]. Furthermore, horizontal resolution has a direct impact on the convection schemes, and convection-permitting models may be required when it is below 3 km. Then, an increased resolution will generally yield more accurate simulations due to the reduction in the dispersion of subgrid-scale phenomena and increasing coverage of wavenumber space, particularly in regions that are topographically complex and exhibit heterogeneous land use. However, increased resolutions are not necessarily associated with an increased model skill or a reduction in model biases [33], and it also comes at substantial computational cost [34].

Finally, even simulations with an identical model running at different resolutions may lead to different behaviours due to nonlinearities in the parameterizations; any change in model behaviour and performance might be parameter specific. For example, some studies have shown that increasing horizontal resolution leads to an increase of quantitative precipitation forecast (QPF) especially over complex topography, which can be dependent to convection, and the impact of model domain size and horizontal resolution on QPF may be quite large comparing with the perturbed physics and initial condition uncertainty [29]. The model skill at small scales is found to be better for stable orographic precipitation than in the presence of embedded convective cells because the latter induces a stochastic component in the precipitation field. The impact of the cloud microphysics scheme increases with increasing model resolution, and in the complex topography areas, the impact of enhancing the model resolution turns out to be small and not necessarily beneficial [35]. On the other side, a near-surface parameter like temperature might show a different behaviour depending on the model’s ability in forecasting the structures in the lower-atmospheric boundary layer, the PBL schemes used in the model, and the flow-dependent features in complex terrain [36], as it is also described in [37], where the overall accuracy of temperature forecasts for the highest resolution grid domain did not fully capture the nocturnal and persistent cold pools within mountain valleys and lowlands.

In this paper, the objective is to evaluate the model performance at different spatial grid resolutions (Figure 3) for the 2 m air temperature over selected European cities which belong to different climatic conditions.

4. Results and Discussion

4.1. Benchmark Analysis

Mean daily temperature forecasts for each of the eight European cities of the e-kmf™ model at different spatial resolutions are compared with observed data to calculate the performance evaluation at different lead times. Observed temperature data are available for the years 2013 and 2014, when particular weather periods are also selected to be analysed over the selected cities.

This fortnight is referred to stable and unstable time, where stable means that the meteorological pattern may be generally referred to a presence of a high pressure at synoptic scale with limited temperature changes between the following days, while unstable is referred to a relatively occurrence of low pressure or changeable weather due to the passage of fronts and instabilities with significant variations in temperatures among consecutive days.

Some statistical indexes, commonly known in literature, are used to set up the analysis. Among skill scores, the mean absolute error (MAE) is the one which efficiently represents the magnitude of forecast error in a given verification data set, and it is also daily used by Eni operators for energy and gas planning models. Only the ensemble mean will be used in this paper since this analysis is related to the operational application of the energy consumption model, which can use a single value of temperature. However, the mean of the ensemble scenarios is generally the most accurate forecast [38], and it is more performing than a high-resolution deterministic model prediction.

4.2. Role of Grid Spatial Resolution

To only evaluate impacts of the horizontal grid on the results in the selected areas, vertical resolution (48 levels) and physics have been maintained the same for all simulations. Forecasts scaled on designated cities in Italy, Germany, Belgium, France, and UK are compared at all lead times (11 days in advance) against observations coming from the SYNOP and METAR stations. Statistics describing performances obtained in the various simulations are shown in Tables 14.

In terms of MAE, higher resolutions appear more important for cities in Italy (Table 1) than for cities in Europe (Table 2). This is reasonable due to the presence of a complex topography all around Italian cities (mainly Milano and Torino which are in the Po Valley close to mountains and less for Napoli and Roma, where the maritime climate mitigates the daily variations of temperatures), while for European cities of London, Paris, and Brussels, there are no relevant orographic characteristics around; a partial exception is Munich, in Germany, due to the presence of Alps, southerly to this place.

It appears that the transition from 18 to 5.5 km grid spacing allows a better definition of the main mesoscale features; in fact, differences in the horizontal grid resolution mean differences in mean model orography. In addition, local fluxes may also depend on the subgrid representation of topography. However, the representation of the mean orography appears to play an important role for the prediction of near-surface temperatures. This is not surprising, since the low-level atmospheric spectra closely follow the mean orography spectra [39]. The degradation in forecast skill resulting from using a smoother mean orography can only partially be alleviated by using more variability in the subgrid orography, and the parameterized drag does not affect the flow in the same way as the resolved drag. 2 m temperatures of the cities near relevant mountains (Milano, Torino, but also Munich) are, then, influenced by the nearby topography, especially, when the interaction between synoptic and mesoscale fluxes with the mountains gives an important contribution to the variability of the near-surface variables.

Forecasts of near-surface variables in flat terrain depend on the predictability of the lower-atmospheric boundary layer. In complex terrain, forecasts not only suffer from the model’s inability to reproduce accurate atmospheric conditions in the lower atmosphere but also struggle with representative issues due to mismatches between the model and the actual terrain. In addition, surface forecasts at finer resolutions do not always outperform those at coarser resolutions (see, for instance, the MAE at 5.5 and 8 km, for Torino).

An improvement in the MAE of predicted daily mean temperatures has been obtained with errors generally below 2°C (italicized numbers in Tables 1 and 2) in the overall forecasted period. This MAE threshold may be reasonably considered a focal target for the temperature prediction by a meteorological model. In contrast, there are only small improvements in verification statistics as grid spacing decreased from 8 to 5.5 km. Reducing grid spacing provides more detail and structure (e.g., defining steeper orographic slopes), but it has only a limited impact on traditional objective verification scores in the points analysed here. However, in this application, the verification is also necessarily dependent on the needs of the end user; the e-kmf™ model, with 18 km grid resolution, might be suitable for the prediction of near-surface temperatures by the energy consumption application over cities mainly dominated by Atlantic patterns over a week ahead from the initialization date. The same model resolution depicts an analogous performance only until the fourth day over areas where terrain and surface contrasts may have a significant interaction with mesoscale features. At the end, the 8 km horizontal resolution may result as the best compromise in terms of both computational time and relatively low error scores (<2°C) and in terms of the application to energy consumption models.

In addition to the MAE, which gives an idea of the average magnitude of the forecast errors, the RMSE (root mean square error) puts in evidence the biggest temperature errors at the different lead times. This may be a useful performance analysis parameter, although it does not give information about deviations as the ME (mean error). Concerning the ME analysis (not presented here, for the sake of brevity), it exhibits a general underestimation over Italian cities, while an opposite behaviour is found over European ones. This outcome needs to be further investigated. Afterwards, the RMSE performance over Italian (Table 3) and European cities is shown (Table 4).

Results presented in the Tables 14 faithfully follow the MAE behaviour, particularly for Italian cities. It clearly appears, as above described, how large errors are enhanced from day+7 onwards at all grid resolutions and from day+5 for coarser ones (13 and 18 km). Particularly, it is interesting to note the performance at 18 km grid resolution for Torino, where the presence of the surrounding Alps likely induces errors beyond the 2°C threshold from day+2. For European cities, instead, a better performance is overall confirmed although some flaws are even found from day+5 for the lower grid resolutions. Again, the 8 km grid resolution represents the optimal choice for all the analysed cities in terms of performance and computational costs, considering the model runs several times a day.

4.3. Stable and Unstable Weather Condition Periods

The abovementioned study highlights general results over a period of two years. A subsequent deeper investigation is carried out using the 2 m temperature variable over particular periods characterized by opposite weather pattern configurations in order to evaluate if the e-kmf™ performance is enhanced or smoothed at different grid resolutions.

The following steps describe the adopted methodology chosen for selecting stable and unstable periods through temperature data only. First of all, we calculated the daily mean temperature (Tdaily mean) from observed hourly temperature data (original dataset), then the daily temperature differences between day+0 and day1 (T deviations); then, we calculated the weighted mean of T deviations on a moving window of 14 days (T-weighted mean), and finally, the highest value of T-weighted mean has been associated to an unstable period, while the lowest value to a stable period for each city, as shown in Figure 4 for the example of Milano city.

Two stable periods in winter and summer and two unstable periods in spring and autumn were selected accordingly to a similar synoptic pattern found over the eight cities at the same time and in accordance with the calculations of the weighted temperature mean.(i)1st unstable period: 14–27 April 2013(ii)2nd unstable period: 28 October–10 November 2013(iii)3rd stable period: 23 January–5 February 2014(iv)4th stable period: 23 June–7 July, 2015, where the presence of the massive high pressure in the summer 2015 over the greatest part of Europe let to choose a period (out of the original dataset) which was mutually stable over all the eight Italian and European cities.

For the sake of brevity, in the following pictures, we show the comparison for the 1st (unstable) and the 4th (stable) periods for the eight cities and for all lead times at different grid model resolutions.

It clearly appears how the spread between the MAE for Italian and European cities is confirmed here, and, even wider, worsening the grid resolution model (Figure 5). Forecast errors are between 2 and 4°C from the sixth day onwards during the unstable period and they can reach even 5°C as temperature deviation with a coarser resolution model.

On the contrary, during the presence of the July 2015 anticyclone which embraced the whole of Europe, the performances of the e-kmf™ are higher, as logically one can expect. The MAE remains below the 2°C error threshold event at lower model resolution (Figure 6), expect for Torino and Milano where local topography plays a crucial role, and the temperature forecast errors reach 4°C and 5°C as differences with local observations with the 13 km and 18 km grid resolution.

Considering these selected periods, the presence of synoptic-scale high pressure generally increases the forecast horizon reducing the impact of grid resolution. On the other side, in unstable weather conditions, generally responsible of abrupt changes in observed temperatures, the MAE increases as grid resolution decreases, especially for places where influences of the terrain may become relevant.

5. Conclusions

The analysis about temperature forecast errors has been carried out over eight cities among Italian and European cities during the years 2013 and 2014 and, afterwards, for four periods of two weeks with different synoptic conditions. The MAE and RMSE are used to calculate the performance analysis between observed data and meteorological output of the e-kmf™ weather forecast model at different spatial resolutions (5.5, 8, 13, and 18 km).

The aim of the study is to analyse the role of spatial resolution on temperature forecasts of the e-kmf™ model for those sites characterized by different climate and terrain conditions as tested by the four European cities (London, Paris, Brussels, and Munich) and the four Italian cities (Milano, Torino, Roma, and Napoli). Results on the e-kmf™ spatial resolution analysis show the role of grid size on model performances at any lead time (up to day+11). In particular, the spatial resolution change from 18 to 5.5 km is more relevant over the Italian (Mediterranean) cities than the European (continental) ones, since these are affected by Atlantic climate conditions. A temperature score lower than 2°C (MAE and RMSE), which can be considered an acceptable threshold for an effective temperature forecast in energy consumption applications, is achieved using the forecast model with 5.5 km grid resolution up to day+5 in Milano and Torino, up to day+6 in Napoli and Roma, and up to day+9 in Brussels, Paris, and London. RMSE analysis for Torino (close to the Alpine region) exhibits the lowest performance score at the coarsest resolution model (18 km) already since the day+2 forecast, showing the importance of the surrounding terrain characteristics. For an operative application over both Italian and European cities by minimizing the computational costs, a proper grid size might be 8 km. Finally, considering that it is used as a single daily value in the energy consumption operational prediction model, this benchmark analysis has only looked at the ensemble mean value of temperatures. A future development will account for each forecast ensemble member individually in order to generate a probability distribution for energy consumption predictions.

Abbreviations

e-kmf™:eni-kassandra meteo forecast
IDW:Inverse distance weighting
MAE:Mean absolute error
METAR:Meteorological aerodrome report
NWP model:Numerical weather prediction model
SYNOP:Surface Synoptic observations.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest (neither financial nor nonfinancial ones).

Authors’ Contributions

AC was involved in the benchmark analysis and data interpretation and wrote most of this manuscript. GG was involved in research project development, industrial applications, and revision. RS developed the weather forecast model, collected all observed datasets, and supervised all data simulations. GE was involved in data model analysis and debugging. MM supervised the complete manuscript and approved the final version to be published.

Acknowledgments

This research was carried out in the framework of the Kassandra Meteo Project funded by Eni S.p.A.