Abstract
Water scarcity and drought management is the burning issue in India and hence needs serious attention of researchers to develop rigorous plan and management. Areas that belong to various plateaus, e.g., Chotanagpur plateau, Deccan plateau, etc., are mostly affected by drought in India. In the past decade, Bankura District of West Bengal, which belongs to northeast part of Chotanagpur plateau, faced severe drought several times. However, the assessment of drought scenario in this area is far from conclusive statement till date. In this paper, we simulate standardized precipitation index (SPI) using double exponential (DE) and Holt–Winter exponential smoothing model (HW) for several time steps (e.g., 3 months, 6 months, 12 months, 24 months and 48 months) in the time period of 1979–2014. The comparative analysis between two models indicates that DE is more accurate one. DE is observed with relatively low root mean squared error (RMSE) and high R2 value. Furthermore, drought-prone zones are demarcated using combined scores of principal component analysis (PCA) and those combined scores are estimated using actual, HW and DE simulated SPI in several time steps. At the shorter (3 and 6 months) and longer time step (12, 24 and 48 months), the PCA demonstrates almost same results. The western and northwestern blocks of the district are severely affected by drought, and the southern portions are at mild condition. Spatially distributed RMSE in every time steps is also high in northwestern portions of the study region. Our result may be useful to understand the pattern of drought to take necessary action in management of water resources in Bankura District, West Bengal. Moreover, the study uses an unique methodology to simulate and assess meteorological drought, which is applicable in any region of the world.
Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Avoid common mistakes on your manuscript.
1 Introduction
Drought can be defined as the prolonged period of dry conditions [1, 2], and it is one of the most complicated and least understood natural hazards. It is one of the natural disasters that human being has suffered since the ancient era [3,4,, 4], and it is the costliest, recurrent natural disaster [5]. A meteorological drought can be expressed as a result of a precipitation shortage or as a lack of precipitation over a region for a period of time [4]. According to the Olukayodo [6], meteorological drought is one of the most essential and primary drought types that need to be addressed with care and attention. Meteorological drought simulation is a critical element in drought risk management [7]. Simulation models express the real behavior of a particular phenomenon needs to address appropriately to know the actual nature of a specific system [8]. Simulation of meteorological drought is more important than prediction to understand drought-prone scenarios effectively [9]. Mishra et al. [10] asserted that the accurate assessment and simulation of drought are essential for proper planning and management of water resources. Drought preparedness and mitigation depend upon the large-scale drought monitoring and simulation over a particular geographical area [11]. Many drought simulation models are already developed in the field of civil engineering. Mishra and Desai [12] developed ARIMA and multiplicative seasonal ARIMA models to forecast drought using SPI series. Zhang et al. [13] simulated SPI using the ARMA–GARCH model in China. Habibi et al. [14] simulated SPI12 by applying the Markov chain model. Morid et al. [15] simulated drought using artificial neural network (ANN) using effective drought index (EDI) and standard precipitation index (SPI). Mishra and Desai [16] simulated drought up to 6 months lead time by using and comparing linear stochastic models with the recursive multistep neural network. Barros and Bowden [17] applied self-organizing maps (SOM) and multivariate linear regression analysis to simulate SPI (in 12 months lead time) of Murray–Darling basin of Australia.
Mishra and Desai [12] argued that exponential smoothing techniques are the systematic empirical methods for simulation and forecasting of meteorological drought with a relatively higher accuracy rate. Holt and Winter originally developed exponential smoothing in the year of 1960, but later it was significantly modified by Kalehar [18]. Raha and Gayen [19] innovatively used exponential smoothing models and made realistic predictions of the drought of Bankura District, West Bengal [19]. They also argued that exponential smoothing might be a leading model in simulation and prediction of drought phenomena though seriously neglected in the framework of drought simulation. Exponential smoothing models have seasonal adjustability, and they can perform more realistic simulations than other statistical models [12, 20]. Forecasts produced by exponential smoothing methods are the weighted average of the past observations (Hyndman et al. [21]). In other words, the more recent observation gets higher associated weight. This framework generates reliable forecasts quickly for the wider range of time series, which is a great advantage (Billah et al. [22]). Raha and Gayen [19] stressed over the traditional exponential and Holt–Winter exponential smoothing, and also they have only considered 12 months time step in their research. They have failed to judge the seasonal flexibility of exponential smoothing. The present study simulates drought in different time steps using exponential smoothing and also comparatively assesses different exponential smoothing procedures in drought simulation and modeling. Station-wise assessment of intensity and occurrence rate of meteorological drought is the most necessary parameters of realistic simulation [23,24,25,26,27]. To visualize the drought phenomena effectively, spatial interpolation method IDW is applied. This study attempts to simulate drought (in 1 month lead time) using two exponential smoothing models in the time period of 1979–2014. Thus, this study has following objectives:
-
Simulation of meteorological drought at 1 month lead time using Holt–Winter and double exponential smoothing models
-
Comparative assessment between double exponential and Holt–Winter exponential method in the simulation of meteorological drought.
There is increasing evidence that climate change will affect West Bengal, and especially drought will be one of the most challenging issues for Bankura [28−30]. Mishra and Desai [16] forecasted and analyzed drought in Kangsabati River Basin using an artificial recursive neural network. Markov chain model was applied to estimate dryness of Purulia District by Banik et al. [30]. Lohar and Pal [31] showed that the mean monthly pre-monsoonal rainfall has decreased, and the temperature has increased significantly in the last decades of the twentieth century. According to Ghosh, [32] Bankura and its associated regions are expected to receive less rain in monsoonal season [33]. Bankura is likely to experience a 1ºC rise in average temperature during 2025–2099 [32−33]. Over the past few years, the impact of climate change has felt severely in Bankura [34,35,, 35]. Delay in arrival of monsoon season is observed in Bankura and its associated tract [32]. It is also noticed that summer becomes long, and drought has become more frequent [2, 36]. The problems are further being compounded with the growing population, lack of water resources and adaptation with water-intensive commercial crops. The Bankura, including Gangetic West Bengal (GWB), is less experienced in coping with drought. In such a pandemonium, spatiotemporal simulation of drought of Bankura is a good attempt.
Figure 1 denotes the location map of the study region. Bankura is an administrative division surrounded by Purba Bardhaman and Paschim Bardhaman District in the north, Purulia District in the west, Jhargram and Paschim Medinipur in the south. Although the average rainfall of Bankura District is 1400 mm, but much rainfall happens for the months of June to September [27]. Hot westerly winds prevail in Bankura from March to June in Bankura in the study area [36]. The study region becomes geographer’s attraction for last five to seven years due to excessive drought proneness-, poverty- and migration-related scenarios [27].
2 Datasets and methodology
Table 1 determines the list of meteorological stations with latitude, longitude and elevation (m). Rainfall data from 1979–2014 on daily bases downloaded from the official website of Climate Forecast System Reanalysis (CFSR) in SWAT format. According to Dile and Srinivason [37], NCEP dataset is one of the most widely used trusted datasets that can easily be used in drought simulation and monitoring purposes. Here, daily rainfall data have been converted to monthly rainfall values. Figure 2 denotes the location of meteorological stations marked within the Bankura District, West Bengal. Figure 3 signifies the overall methodological framework of this research. From rainfall data, SPI has been estimated in 3, 6, 12, 24 and 48 months time frame. Peak drought intensity (lowest SPI value), extreme to severe and moderate drought occurrence rate are assessed using observed and simulated models at above mentioned time steps. Hazard zones are also identified based on PCA score generated using three models.
2.1 Determination of drought
SPI was developed by Mckee et. al. [38] to monitor drought. The responsiveness of precipitation deficits of SPI was found more reliable within shorter and longer time steps [39]. Because of simplicity of calculation, ability to address different drought-related issues at a glance, SPI is found most suitable and reliable can be applicable in different parts of the world [40,41,42]. Calculation of SPI should be based on different time steps to denote drought effectively [43,44,, 44]. Here, classification of SPI is based on 3 months, 6 months, 12 months, 24 months and 48 months time step. Three months and 6 months are considered as short term and 12, 24, and 48 months time frame is considered as long-term time frame. The analysis of SPI is performed in Meteorological Drought Monitoring (MDM) software prepared by Agrimetsoft team [44]. This software is a critical tool for calculating and comparing drought in multiple locations at different time steps. Salehnia et. al. [45] and agrimetsoft.com express detailed procedure of calculation of SPI through MDM software as follows:
After arranging the station-wise rainfall data (x) in column format, that data are incorporated into the MDM software. Using the input data, deviation of total rainfall (x) from long-term rainfall mean \((\overline{x})\) is estimated by the software. After completion of the process, that total deviation is divided by the standard deviation of rainfall (\({\updelta })\) which is basically as follows [45,46,, 46]:
The long-term rainfall is then fitted to the probability distribution and then transformed into the normal distribution to the mean SPI for the location and the desired period is zero [36],
where a is the individual gamma distribution, M is mean, and δ is the standard deviation of rainfall.
Based on Mckee et. al. [38], drought severity classes are identified in Table 2.
2.2 Exponential smoothing models
Exponential smoothing is a technique to smooth time series data using the exponential window function. Exponential smoothing technique assigns decreasing weights over time. This framework generates reliable forecast quickly and has a great advantage in real-time simulation [44]. The two basic exponential smoothing models are taken in this study which is as follows:
2.2.1 Holt–Winter exponential smoothing
Holt–Winter exponential smoothing is a statistical technique which is used here to simulate SPI at 3, 6, 12, 24 and 48 months time steps. It provides effective way of simulation of time series data. This method is efficient to capture seasonality [47]. There are three weights of smoothing constants, viz. α, β, Y representing the level, trend and season, respectively, used to upgrade component for each period of time, t. The value of these constants lies within 0 and 1. This value is selected depending on the weight (high smooth constant mean ensures more weight). The initial value taken in this research is 0.2. Holt–Winter model is applied in this research using following equations [47]:
Level:
Trend:
Seasonal:
Fitted:
where Lt is the level, bt is the trend, St is the season, Yt is the SPI, and t is the time period of Lt, bt, St and Yt components. Ft is the forecast value ahead of one period, α, β, Υ level, trend and seasonal smoothing coefficient estimators, respectively, St is the seasonal duration. K is the integer part of (h − 1)/m, which ensures the estimates of seasonal indices, used for forecasting, come from the final year Y, i.e., here SPI. The level equation shows a weighted average between seasonally adjusted observation \(\left( {Y_{t} - S_{t - m} } \right)\) and the non-seasonal forecast \(\left( {L_{t - 1} + b_{t - 1} } \right)\) for time t. The initial value of seasonal component s1 is determined using Eq. (7),
The Holt–Winter model is also known as Winter’s triple exponential smoothing [47] as this method was originally developed by Winter. In this method, the seasonal component is expressed in absolute terms with the scale of the observed series and in the level equation is seasonally adjusted by subtracting the seasonal component. Within each year, the seasonal component will add up approximately zero. Here, Yt − Lt in Eq. 5 is the weighted average of seasonally adjusted observation of SPI. Non-seasonal simulated term is \(L_{{{\text{t}} - 1}} + b_{t - 1}\) for time t. This method has great adjustability in interpreting seasonal and non-seasonal behavior [44].
2.2.2 Double exponential smoothing
Double exponential smoothing is applicable when the data show a trend. In this method, seasonal component and trend component are updated in each period [47, 48]:
Simulated:
Seasonal:
Trend:
Here, \(S_{t}\) is the weighted average observation of Yt. The one-period-ahead simulation is given by \(S_{t} + hb_{t}\). Equation (11) shows that bt is the weighted average of estimated trend at time t. The h-step-ahead simulation is equal to the last estimated level plus h times the last estimated trend value. Thus, Eq. 9 is the linear function of h.
2.3 Drought evaluation parameters—Intensity and frequency
Wu et. al [39], Ghosh [27] and Hayes et. al. [49] identified intensity and occurrence rate as the basic ingredients of drought. Statistical models may be applied to simulate SPI in different time steps. Comparative assessment of different statistical models is one of the best keys in spatiotemporal simulation of drought (Hayes et. al. [50], Ghosh [51], Majumder et. al. [52], Touma et. al. [53], Chen et. al. [54]). Welner and Santner [55], Jeong et. al. [56] and Aryal and Zhu [57] analyzed changing pattern of drought frequency and intensity in different time steps. The present research utilizes peak intensity and occurrence rate as the drought evaluation parameters:
2.3.1 Peak intensity (PID)
ID denotes departure of a climate index from its normal value (Thomposon [58]). Here, drought event is defined as a period in which SPI is continuously reaching a value toward − 0.99 or less. Here, PID denotes the lowest value of SPI which is simulated by observed and exponential smoothing models at different time steps. Lesser the value (− 0.99 or less) more will be the drought intensity.
2.3.2 Occurrence rate of drought (FD)
FD is used to assess the drought liability during the study period (Wang et. al. [59]). The number of droughts per 35 years is calculated as:
where \(F_{Dj \cdot 35}\) is the frequency of droughts for timescale j in 35 years, \(N_{j}\) is the number of months with droughts for the timescale j in the n year set, j is the timescale (e.g., here, 3, 6, 12, 24 and 48 months), and n is the number of years in the dataset. FD is also known as the frequency of drought.
Drought intensity and occurrence rate are estimated using observed and simulated SPI in 3, 6, 12, 24 and 48 months time steps, respectively.
2.4 Drought risk zone estimation using PCA
Principal component analysis (PCA) is one of the most effective and popular techniques to estimate drought risk zones (Cai et. al. [60], Dinpashoh et. al. [61]). PCA is a multivariate technique that reduces the dimensionality of the dataset and computes a set of new orthogonal variables in decreasing order (Demsar et. al. [62]). The technique of Joliffe [63] has been followed in this research, to estimate principal components. In this study, principal components are estimated in 3 months, 6 months and 48 months time steps. Here, the technique of Joliffe 2002 is used to calculate principal components using intensity and frequency of drought in different time steps. Here, it is assumed that x is a vector of p random variables. In this study, PCA is concerned with the correlation and covariance. A linear function \(\alpha^{\prime } x\) of the elements of x having maximum variance where α1 is a vector of p constants α11, α12,…, α1p, and ′ denotes transpose, so that
Next, a linear function \(\alpha_{2}^{\prime } x\) uncorrelated with \(\alpha_{1}^{\prime } x\) having maximum variance, and so, at the k-th stage a linear function \(\alpha_{k}^{\prime } x\) is found that has maximum variance subject to being uncorrelated with \(\alpha_{1}^{\prime } x,\alpha_{2}^{\prime } x, \ldots \alpha_{k - 1}^{\prime } x\). The k-th derived variable, \(\alpha_{k}^{\prime } x\), is the k-th PC. Here, \(\alpha_{1}^{\prime } x\) the vector \(\alpha_{1}\) maximizes such that \({\text{var}} \left[ {\alpha^{\prime}_{1} x} \right] - \alpha_{1}^{\prime } \sum \alpha_{1}^{\prime } = 0\). To maximize \(\alpha_{1}^{\prime } \sum \alpha_{1}^{\prime }\), here the Lagrangian multipliers are used. \(\alpha_{1}^{\prime } \sum \alpha_{1}^{\prime } - \gamma \left( {\alpha_{1}^{^{\prime}} \sum \alpha_{1} - 1} \right)\) is maximized where \(\gamma\) is the Lagrangian multiplier. With respect to \(\alpha_{1}\), differentiation of the function \(\sum \alpha_{1} - \gamma \alpha_{1} = 0\) is done. Now, here \(I_{p}\) is considered as the identity matrix \(\sum \alpha_{1} - \gamma I_{p} = 0\) where \(\gamma\) is the eigenvalue of ∑ and \(\alpha_{1}\) is the corresponding eigenvector. To decide which of the p eigenvectors gives maximum variances, the condition \(\left[ {\alpha^{\prime}_{1} x} \right] = \alpha_{1}^{\prime } \sum \alpha_{1}^{\prime } = \gamma\) is used. The k-th PC of x is \(\left[ {\alpha^{\prime}_{k} x} \right] = \gamma_{k}\) where \(\gamma_{k}\) is the k-th largest eigenvalue of ∑. \(\alpha^{k}\) is the corresponding eigenvector. The second PC \(\alpha^{\prime}_{2} x\) maximizes \(\alpha^{\prime}_{2} \sum \alpha_{2}\) subject to being uncorrelated \(\alpha^{\prime}_{1} x\) or equivalently subject to \({\text{cov}} \left( {\alpha^{\prime}_{1} x,\alpha^{\prime}_{2} x} \right) = 0\), i.e., \({\text{cov}} \left( {x,y} \right)\) signifies covariance between x and y. Differentiation with respect to \(\alpha_{2}\) gives us \(\sum \alpha_{2} - \gamma \alpha_{2} -\upphi \alpha_{1}^{^{\prime}} \alpha_{1} = 0\) and multiplying the covariance equation with \(\alpha_{1}^{^{\prime}}\) \(\alpha_{1}^{^{\prime}} \sum \alpha_{2} - \gamma \alpha_{1}^{^{\prime}} \alpha_{2} -\upphi \cdot \alpha_{1}^{^{\prime}} \alpha_{1} = 0\) is obtained. So, In case of second PC, \(\sum \alpha_{2} - \gamma \alpha_{2} = 0\) equation is obtained, and with respect to identity matrix, \(\sum \alpha_{2} - \gamma I_{p} = 0\) equation is obtained. Here, \(\alpha_{2}\) is the corresponding eigenvector. The success of PCA is due to following two optional properties (Zou et. al. [64]):
-
a.
Principal components sequentially capture maximum variability in the given dataset X, thus generating a minimum information loss
-
b.
Principal components are uncorrelated so one principal component can be mentioned without referring to others;
2.5 Spatial analysis through IDW method
In order to generalize calculated SPI values in six stations to whole study area, inverse distance weighting (IDW) interpolation is used where power was considered as 2, searching neighborhood was standard with at least 10 neighborhoods. The inverse distance weighting method is an established deterministic method of SPI mapping (Alam et. al. [24], Attore et. al. [65]) which can be applied in little or more spatial information. IDW is one of the frequently used techniques of spatial interpolation and mapping (Lu and Wong [66]).
2.6 Interrelationship assessment, error judgment and significance test of the models
2.6.1 Interrelationship assessment between models using R2 value
Pearson’s correlation coefficient is the popular measure of the linear correlation between two variables. The value lies between − 1 and + 1 where − 1 represents total negative correlation and + 1 represents total positive correlation between variables, whereas 0 represents no correlation.
where xi is the observed value at time i, yi is the simulated value at time i, \(\overline{x}\) is the mean of the observed value, and \(\overline{y}\) is the mean of the simulated value. Positive R2 value indicates direct correlation, and negative value indicates inverse correlation.
2.6.2 Estimation of root mean squared error (RMSE)
Root mean squared error (RMSE) serves to aggregate the magnitude of errors in simulation of various time steps. RMSE has the great adjustability in calculation of accuracy measures in different time steps. Mavromatis [9] used RMSE in assessment of simulation models. The formula of RMSE is as follows
where Ov denotes observed value, Mv denotes the simulated model value, and n is the number of observation.
In this study, Holt–Winter model (HW) and double exponential (DE) models are considered for simulation. So, two RMSEs are obtained:
Thus, combined RMSE can be stated as (from Eq. 16 and Eq. 17):
High RMSE value means there is a larger difference between the actual and simulated drought exists in the particular time step at a particular year or months (Zalenski et. al. [67]). Lower RMSE value indicates higher accuracy.
2.6.3 Anderson–Darling test (AD test)
Anderson–Darling test compares empirical cumulative distribution of sample data series. This test is very effective when the data series becomes sufficiently large. The Anderson–Darling test is the hypothesized distribution is F, and cumulative distribution is Fn and the formula can be written as
2.6.4 Kolmogorov–Smirnov test (KS test)
Kolmogorov–Smirnov test is a nonparametric test of the equality of continuous one-dimensional probability distribution with compare of a sample with reference probability distribution (Kolmogorov [68] and Smirnov [69]). Kolmogorov–Smirnov test statistic can be expressed as (Razali et. al. [70])
where \(I_{{\left[ { - \infty ,x} \right]}} \left( {X_{i} } \right)\) is the indicator function, equal 1 if \(\left( {X_{i} } \right)\) ≤ x and equal to 0 otherwise.
The Kolmogorov–Smirnov statistic of a given cumulative function F(x) is
where sup is the supremum of the set of distance between the \(F_{n} x\) and \(F_{x}\). In our case, this model has been run at 95% significance level.
2.6.5 Ryan joiner test (RJ test)
Ryan Joiner test assesses correlation of a given data series (Yap and Sim [71]). It assesses the strength of the correlation of a given data matrix. If it falls below the critical value, the null hypothesis can be rejected and alternative hypothesis can be accepted. As the correlations of datasets are represented by this test, this procedure is important in judgment of simulation models.
3 Results
Figure 4a–j represents the scatter plots which represent the interrelationship between different model variables. The interrelationships of different model variables show that all the models at every time steps are strongly correlated with above 90% R-squared value. About 96–98% correlation coefficient is observed between double exponential smoothing and observed models. Correlation coefficient value decreases slightly between observed and Holt–Winter exponential smoothing. Significance level of every model is estimated by KS, AD and RJ test. Almost all models are significant at 0.01, 0.05 or 0.005 significance level. Table 5 (in supplementary file) represents the results of significance test.
3.1 Station-wise comparison of 3 months and 6 months SPI
According to observed model, at the 3 months time step, drought (SPI value − 3.125) is at the peak in November 1979 at the station 229,875. At the same time step, according to observed model, extreme to severe (ES) and moderate drought occur at the rate of 8% and 10%, respectively, at the station 233,869. According to observed model, at 6 months time step, peak drought (SPI value − 5.061) is observed in January 1980 at station 233,875. At the same time step, according to observed model, ES and moderate drought occurrence rate (8% for ES and 10% for moderate drought) is highest at 233,869 and 229,875 stations. At, 3 months and 6 months time steps, simulated curves follow observed curves to denote dry and wet conditions (Fig. 5a, b, d, e, g, h). At 3 months time step, SPI simulated by double exponential model is characterized by − 2.387 value (peak intensity) which is observed in January 1993. At 3 months time step, the Holt–Winter model-simulated drought is at its peak in January 1979 at station 233,872. At 6 months time step, double exponential smoothing simulated drought reaches at the peak in April 1980 at the station 233,872. Similarly, Holt–Winter model-simulated drought at the same time step reaches at the lowest level with -2.361 SPI value which is occurred in February 1983 at station 229,872. Drought (category ES) simulated by double exponential and Holt–Winter model reaches at the highest rate of occurrence (about 7% for double exponential and 11% for Holt–Winter) at the station 229,872. Similarly, double exponential and Holt–Winter exponential model simulated moderate drought occurs at the highest rate (e.g., 13% for double exponential and 10% for Holt–Winter) at stations 233,869 and 229,875, respectively. Overall at 3 months and 6 months time steps, drought intensity is at its peak in post monsoon phase. Station-wise drought occurrence rate and drought intensity at different time steps are denoted in Table 4 (in supplementary file).
3-Month, 6-month and 12-month observed and simulated SPI over 1 month lead time of some selected stations (for the efficient understanding, we have mentioned here three stations—station 229,872 and 229,875 have similar nature of drought with station 233,869. 233,872 station has similar characteristics of 233,875)
3.2 Station-wise drought comparison of 12 months, 24 months and 48 months SPI
According to observed model, at 12 months time steps, drought intensity is at its peak with SPI value -5.291 which is observed in December 1979 at station 233,872. At the same time step, according to observed model, ES and moderate droughts are observed in their highest rate of occurrence at the station 229,875 and 233,875. At 24 months time steps, according to observed model, peak drought intensity (SPI − 2.391) is observed in December 1980 at the station 233,872. At 24 months time steps, ES and moderate droughts are noticed with 4% and 14% occurrence rate, respectively, at 233,872 and 233,875 meteorological station. Similarly at 48 months time steps, drought is intensified at the station 233,872 in December 1986 with − 1.513 SPI value. At this time step, ES and moderate drought reach their highest rate of occurrence (e.g., 5–15%) at 233,872, 233,875 and 233,869 station.
The simulated SPI at long-term time frame also follows the observed curves (Fig. 5c, f, i and Fig. 6–f). The SPI (at 12 months time steps) simulated by double exponential model reaches at the lowest level (SPI − 5.667) in July 1980. Holt–Winter model-simulated drought (category ES) at 12 months time steps reaches at the peak (SPI value − 3.771) in April 1980 at the station 233,872. At 24 months time steps, drought simulated by double exponential and Holt–Winter model reaches at the peak with − 2.291 SPI value which is observed in 1984 (please see Table 4 in supplementary section). At 12 months time step, the double exponential model simulated ES and moderate drought reach their highest rate of occurrence (about 13% and 11%, respectively) at the station 233,869. At the same time frame, Holt–Winter model simulated ES and moderate droughts are found in their highest rate of occurrence (about 14%) at 233,869 and 229,872 stations, respectively. At 24 months time frame, double exponential smoothing simulated ES and moderate droughts are identified with 7% and 14% occurrence rate at 233,869 and 229,875 stations, respectively. At 48 months time step, the occurrence rate of extreme to severe drought is negligible. Moderate drought (double exponential model simulated) is observed with 17% occurrence rate at 48 months time step at 229,872 station. Figure 7a–f expresses station-wise drought occurrence rate.
24-month and 48-month observed and simulated SPI over 1 month lead time of some selected stations (for the efficient understanding, we have mentioned here three stations—station 229,872 and 229,875 have similar nature of drought with station 233,869. 233,872 station has similar characteristics of 233,875)
3.3 Spatial assessment
Overall northwestern and western portions of the district face highest intensity of drought. This region also experiences highest occurrence rate of extreme to severe and moderate drought at almost all the time frames. At 3 months and 6 months time frame, south and southwestern portions experience − 2 to − 3 drought intensity value. On the other hand, − 3 to − 5 drought intensity is observed in north and northwestern portions of the study region (Fig. 8). At 12, 24 and 48 months time frame, the peak drought is observed at northwestern part of the study region (SPI value is − 1 to − 3). At 3 months and 6 months time frame, extreme to severe and moderate drought occurrence rate varies from 6 to 3% and 5 to 11%, respectively. This fact is applicable for all models at above mentioned time steps. Southern and southwestern portion of the region experiences 3–4% extreme drought and 5–6% moderate drought at all the time steps. Drought occurrence rate is noticed at a significant level (5–6% extreme to severe, 10–11% moderate drought) in western and northwestern portion of the district at 3 months and 6 months time steps. At 12, 24 and 48 months time frame, extreme drought is observed in northwestern part of the district with 2–4% occurrence rate. Figure 8a–o denotes visualization of drought intensity spatially. Figures 9a–o and 10a–o express extreme to severe and moderate drought occurrence rate spatially. As the time step increases, drought intensity decreases and extreme to severe drought becomes less frequent. On the contrary, as the time step advances, moderate drought becomes more frequent.
3.4 Drought-prone zone identification
Table 3 identifies all variables used in PCA. At 3 months time steps V1, V2, V3, V10, V11,V12, V19, V20 and V21, at 6 months time steps V4, V5, V6, V13, V14, V15, V22, V23 and V24 and at 48 months time step V7, V8, V9, V16, V17, V18, V25, V26 and V27 are considered. Similarly, at 12 months and 24 months time step, V28 to V45 variables are used. For every time step, first four components represent near about 80% of the dataset that is why first four components are considered for determination of PCA. For every time step, northwestern and western portions are demarcated as extreme to severe drought proneness. On the other hand, the eastern edge and southwestern portions are at mild to normal drought-prone condition. At all the time frame, Saltora, Mejhia, Gangajalghati, Chattna, Bankura-I, Bankura-II, Indpur, Onda, Barjora, Sonamukhi and Hirbandh are the highest drought-prone blocks. Patrasayar, Indus, Kotulpur, Jaypur, Bishnupur, Taldangra and Simlipal are the moderate drought-prone blocks. Khatra and Ranibandh are the least drought-affected blocks. The drought-affected region is marked in Fig. 11a–f.
3.5 Assessment of root mean square error (RMSE)
At 3 months and 6 months time step, the highest RMSE is observed in 1998–1999 at 229,869 stations. Station 233,869 and 2338,75 achieve the highest RMSE in 2010–2012 at 3 months and 6 months time steps (Fig. 12a, b, d, e, g, h). 0.1 to 20 RMSE is observed in 12, 24 and 48 months time step. As time step increases, RMSE starts to decrease in a significant proportion. At 12 months time step, the highest error occurs in 2012–2014 at the station 229,869 and station 233,869 (Fig. 12c, f). At this time step, the highest RMSE is observed in 1998–2000 at station 233,875 (Fig. 12i). At 24 months time step, the 229,869 station at 1989–1991 achieved the highest error (Fig. 13a). However, station 233,872 and 233,875 achieved the highest error in 2012–2014 (Fig. 13c, e). Similar nature is also observed in 48 months time frame (Fig. 13b, d, f). High RMSE is observed at northern and northwestern parts at almost all the time frames (Fig. 14a–j). It is interesting to note that extreme to severe drought-prone stations face higher amount of RMSE. In the 3 months time frame, southern and southwestern portions are experiencing total 104–110 RMSE, whereas northern and northwestern portions are experiencing total 111–116 RMSE. At 6 months time frame, southern and southwestern portions are experiencing total 40–44 RMSE, whereas 45–51 total RMSE is observed in northern and northwestern portions. At 12 months, 24 months and 48 months time step, 0.12–14 RMSE is observed at southern and southwestern portion, whereas northern and northwestern portions experience 14–19 RMSE value. With respect to combined error of Holt–Winter and double exponential model, double exponential model produces better result with slightly low (2%–4%) RMSE.
Estimation of root mean square error (RMSE) of some selected stations () at 3 months, 6 months and 12 months time steps (for the efficient understanding, we have mentioned here three stations—station 229,872 and 229,875 have similar nature of drought with station 233,869. 233,872 station has similar characteristics of 233,875)
Estimation of root mean square error (RMSE) of some selected stations () at 24 and 48 months time frame (for the efficient understanding, we have mentioned here three stations—station 229,872 and 229,875 have similar nature of drought with station 233,869. 233,872 station has similar characteristics of 233,875)
4 Discussion and conclusion
In summary, we simulate meteorological drought for Bankura District using double exponential and Holt–Winter exponential model at several time steps. We also compared the efficiency of these two models, which shows that double exponential smoothing model is more accurate one. In our analysis, intensity and frequency of drought act as most important parameters. Intensity and frequency of drought are portrayed using visual interpretative maps and statistical assessment for several time steps. Our study shows that peak intensity of drought decreases with increasing time steps (Fig. 8a–o) and the scenario of drought shifts from extreme to moderate condition (Figs. 5 and 6). RMSE also decreases with increasing time steps (Figs. 12 and 13). Our study reveals that drought scenario is worst for north and northwestern region of Bankura, while the effect of drought is less severe in other regions (southeast, southwest) (Fig. 11a–e). Furthermore, we perform principal component analysis by combining the results from the two models, to delineate drought-prone zones of the district (Fig. 11a–e). Overall the principal component analysis (PCA) reveals that north and northwest portions of the study area are the ES drought-affected zone, while drought is mild at southern portions of the study region.
This study includes several time steps for spatiotemporal simulation of drought, which makes the study interesting and unique. The identification and monitoring of different drought-related parameters at the same time and effective identification of drought-prone zones make this study fruitful one for implementation. This study may also be useful to take actions for improved resiliency of the water management infrastructure, including a more accurate drought forecasting tool for sustainable agricultural production in Bankura District.
References
Chhajer V, Prabhakar S, Prasad, and P. R. C. (2015) Development of index to assess drought conditions using geospatial data a case study of Jaisalmer District, Rajasthan, India. Geoinform Polon 14(1):29–39. https://doi.org/10.1515/gein-2015-0003
Mishra AK, Singh VP (2011) Drought modeling–a review. J Hydrol 403(1–2):157–175. https://doi.org/10.1016/j.jhydrol.2011.03.049
Zarch MAA, Sivakumar B, Sharma A (2015) Droughts in a warming climate: a global assessment of standardized precipitation index (SPI) and reconnaissance drought index (RDI). J Hydrol 526:183–195. https://doi.org/10.1016/j.jhydrol.2014.09.071
Gupta AK, Tyagi P, Sehgal VK (2011) Drought disaster challenges and mitigation in India: strategic appraisal. Curr Sci 100(12):1795–1806
Drumond A, Gimeno L, Nieto R, Trigo RM, Vicente-Serrano SM (2017) Drought episodes in the climatological sinks of the Mediterranean moisture source: The role of moisture transport. Global Planet Change 151:4–14. https://doi.org/10.1016/j.gloplacha.2016.12.004
Olukayode Oladipo E (1985) A comparative performance analysis of three meteorological drought indices. J Climate 5(6):655–664. https://doi.org/10.1002/joc.3370050607
Özger M, Mishra AK, Singh VP (2012) Long lead time drought forecasting using a wavelet and fuzzy logic combination model: a case study in Texas. J Hydro Meteorol 13(1):284–297. https://doi.org/10.1175/JHM-D-10-05007.1
Gupta A, Rawlings JB (2014) Comparison of parameter estimation methods in stochastic chemical kinetic models: examples in systems biology. AIChE J 60(4):1253–1268. https://doi.org/10.1002/aic.14409
Mavromatis T (2007) Drought index evaluation for assessing future wheat production in Greece. Int J Climatol 27(7):911–924. https://doi.org/10.1002/joc.1444
Mishra AK, Sivakumar B, Singh VP (2015) Drought processes, modeling, and mitigation. J Hydrol 526:1–2. https://doi.org/10.1016/j.jhydrol.2015.03.054
Wilhite DA (1996) A Methodology for drought preparedness. Nat. Hazards 13(3):229–252
Mishra AK, Desai VR (2005) Drought forecasting using stochastic models. Stoch Environ Res Risk A 19(5):326–339. https://doi.org/10.1007/s00477-005-0238-4
Zhang G, Ali S, Wang X, Wang G, Pan Z, Zhang J (2019) SPI-based drought simulation and prediction using ARMA-GARCH model. Appl Math Computat 355:96–107. https://doi.org/10.1016/j.amc.2019.02.058
Habibi B, Meddi M, Torfs PJJF, Remaoun M, Van Lanen HAJ (2018) Characterization and prediction of meteorological drought using stochastic models in the semi-arid Chéliff-Zahrez basin (Algeria). Journal of Hydrology: Regional Studies 16:15–31. https://doi.org/10.1016/j.ejrh.2018.02.005
Morid S, Smakhtin V, Bagherzadeh K (2007) Drought forecasting using artificial neural networks and time series of drought indices. Int J Climatol 27(15):2103–2111. https://doi.org/10.1002/joc.1498
Mishra AK, Desai VR (2006) Drought forecasting using feed-forward recursive neural network. Ecol Model 198(1–2):127–138. https://doi.org/10.1016/j.ecolmodel.2006.04.017
Barros AP, Bowden GJ (2008) Toward long-lead operational forecasts of drought: an experimental study in The Murray-Darling River Basin. J Hydrol 357(3–4):349–367. https://doi.org/10.1016/j.jhydrol.2008.05.026
Kalehar PS (2004) Time series forecasting using Holt–Winter exponential smoothing. https://labs.omniti.com/people/jesus/papers/holtwinters.pdf. Accessed 15 Nov 2019
Raha S, Gayen SK (2019) Simulation of meteorological drought of Bankura District, West Bengal: comparative study between exponential smoothing and machine learning procedures. J Geogr Environ Earth Sci Int. https://doi.org/10.9734/jgeesi/2019/v22i130135
Lewis PAW, Ray BK (2002) Nonlinear modeling of periodic threshold auto regressions using TSMARS. J Time Ser Anal 23(4):459–471. https://doi.org/10.1111/1467-9892.t01-1-00269
Hyndman RJ (2004) The interaction between trend and seasonality. Int J Forecast 20(4):561–563
Billah B, Hyndman RJ, Koehler AB (2005) Empirical information criteria for time series forecasting model selection. J Stat Comput Simul 75(10):831–840
Smakhtin VU, Hughes DA (2004) Review, automated estimation and analyses of drought indices in South Asia, vol 83. IWMI
Alam JA, Rahman SM, Saadat AHM (2013) Monitoring meteorological and agricultural drought dynamics in Barind region Bangladesh using standard precipitation index and Markov chain model. Int J Geomat Geosci 3(3):511–523
Haied N, Foufou A, Chaab S, Azlaoui M, Khadri S, Benzahia K, Benzahia I (2017) Drought assessment and monitoring using meteorological indices in a semi-arid region. Energy Procedia 119:518–529. https://doi.org/10.1016/j.egypro.2017.07.064
Hao Z, Singh VP, Xia Y (2018) Seasonal drought prediction: advances, challenges, and future prospects. Rev Geophys 56(1):108–141. https://doi.org/10.1002/2016RG000549
Ghosh KG (2019) Spatial and temporal appraisal of drought jeopardy over the Gangetic West Bengal, eastern India. Geoenviron Disasters. https://doi.org/10.1186/s40677-018-0117-1
Khan JH, Hassan T, Shamsad, (2011) Socio economic causes of rural urban migration in India. Asia-Pac J Soc Sci 10:138–158
Rogaly B (1998) Workers on the move: seasonal migration and changing social relations in rural India. Gender Dev 6(1):21–29. https://doi.org/10.1080/741922628
Banik P, Mandal A, Rahman MS (2002) Markov chain analysis of weekly rainfall data in determining drought-proneness. Discrete Dyn Nat Soc 7(4):231–239. https://doi.org/10.1155/S1026022602000262
Lohar D, Pal B (1995) The effect of irrigation on pre-monsoon season precipitation over south West Bengal India. J Climate 8(10):2567–2570. https://doi.org/10.1175/1520-0442(1995)008%3C2567:TEOIOP%3E2.0.CO;2
Ghosh KG (2016) Long range climatic variability over Birbhum District, west Bengal and their impact on rainfed Aman crop in the context of climate change: adoption and mitigation. In: Chattopadhyay PK, Kushwaha DS (eds) Rural health, women empowerment and agriculture: issues and challenges, chap 21, 1st edn. New Delhi Publishers, New Delhi, pp 277–298
Datta P, Das S (2019) Analysis of long-term precipitation changes in West Bengal, India: AN approach to detect monotonic trends influenced by autocorrelations. Dyn Atmos Oceans. https://doi.org/10.1016/j.dynatmoce.2019.101118
Bhave AG, Mishra A, Groot A (2013) Sub-basin scale characterization of climate change vulnerability, impacts and adaptation in an Indian River basin. Reg Environ Change 13(5):1087–1098. https://doi.org/10.1007/s10113-013-0416-8
Vass KK, Das MK, Srivastava PK, Dey S (2009) Assessing the impact of climate change on inland fisheries in River Ganga and its plains in India. Aquat Ecosyst Health 12(2):138–151. https://doi.org/10.1080/14634980902908746
Ghosh KG (2018) Analysis of rainfall trends and its spatial patterns during the last century over the Gangetic West Bengal, Eastern India. J Geovis Spatial Anal 2(2):15. https://doi.org/10.1007/s41651-018-0022-x
Dile YT, Srinivasan R (2014) Evaluation of CFSR climate data for hydrologic prediction in data-scarce watersheds: an application in The Blue Nile River Basin. J Am Water Resour Assoc 50(5):1226–1241. https://doi.org/10.1111/jawr.12182
McKee TB, Doesken NJ, Kleist J (1993) The relationship of drought frequency and duration to time scales. In: Proceedings of the 8th conference on applied climatology, vol 17, no 22. American Meteorological Society, Boston, MA, pp 179–183
Wu H, Hayes MJ, Weiss A, Hu Q (2001) An evaluation of the standardized precipitation index, the China-Z Index and the Statistical Z-Score. Int J Climatol 21:745–758. https://doi.org/10.1002/joc.658
Durdu ÖF (2010) Application of linear stochastic models for drought forecasting in the Büyük Menderes river basin, western Turkey. Stoch Environ Res Risk Assess 24(8):1145–1162. https://doi.org/10.1007/s00477-010-0366-3
Turkes M, Tatli H (2009) Use of the standardized precipitation index (spi) and a modified SPI for shaping the drought probabilities over turkey. Int J Climatol 29:2270–2282. https://doi.org/10.1002/joc.1862
Lana X, Burgueno A (2000) Statistical distribution and spectral analysis of rainfall anomalies for Barcelona (NE Spain). Theor Appl Climatol 66:211–227. https://doi.org/10.1007/s007040070026
Dogan S, Berktay A, Singh VP (2012) Comparison of multi-monthly rainfall-based drought severity indices, with application to semi-arid Konya closed basin, Turkey. J Hydrol 470–471:255–268. https://doi.org/10.1016/j.jhydrol.2012.09.003
AgriMetSoft (2017) Meteorological drought monitor (version 1) [computer software]. https://agrimetsoft.com/MDM.aspx. Accessed 10 Oct 2019
Salehnia N, Alizadeh A, Sanaeinejad H, Bannayan M, Zarrin A, Hoogenboom G (2017) Estimation of meteorological drought indices based on AgMERRA precipitation data and station-observed precipitation data. J Arid Land 9(6):797–809
Sonmez FK, Komuscu AU, Erkan A, Turgu E (2005) An analysis of spatial and temporal dimension of drought vulnerability in Turkey using the standardized precipitation index. Nat Hazards 35(2):243–264. https://doi.org/10.1007/s11069-004-5704-7
Hyndman RJ, Athanasopoulos G (2018) Forecasting: principles and practice, vol 2. OTexts, pp 1–504
Hussain A, Rahman M, Memon JA (2016) Forecasting electricity consumption in Pakistan: the way forward. Energy Policy 90:73–80
Gundalia MJ, Dholakia MB (2012) Prediction of maximum/minimum temperatures using Holt Winters method with Excel spreadsheet for Junagadh region. Int J Eng Res Technol 1:1–8
Hayes MJ, Svoboda MD, Wardlow BD, Anderson MC, Kogan F (2012) Drought monitoring: historical and current perspectives. University of Nebraska Bulletin, pp 1–19.
Ghosh S, Mujumdar PP (2007) Nonparametric methods for modeling GCM and scenario uncertainty in drought assessment. Water Resour Res 43(7):1–9. https://doi.org/10.1029/2006WR005351
Mujumdar P, Ghosh S, Raje D (2009) Hydro-meteorological predictions from GCM simulations: downscaling techniques and uncertainty modeling. IAHS Publ 333:165
Touma D, Ashfaq M, Nayak MA, Kao S-C, Diffenbaugh NS (2015) A multi-model and multi-index evaluation of drought characteristics in the 21st century. J Hydrol 526:196–207. https://doi.org/10.1016/j.jhydrol.2014.12.011
Chen ST, Kuo CC, Yu PS (2009) Historical trends and variability of meteorological droughts in Taiwan. Hydrol Sci J 54(3):430–441. https://doi.org/10.1623/hysj.54.3.430
Wehner M, Santer B (2011) Projections of future drought in the continental United States and Mexico. J Hydrometeorol 12(6):1359–1377. https://doi.org/10.1175/2011JHM1351.1
Jeong DI, Sushama L, Khaliq MN (2014) The role of temperature in drought projections over North America. Clim Change 127(2):298–303. https://doi.org/10.1007/s10584-014-1248-3
Aryal Y, Zhu J (2017) On bias correction in drought frequency analysis based on climate models. Clim Change 140(3–4):361–374. https://doi.org/10.1007/s10584-016-1862-3
Thompson S (1999) Hydrology for water management. AA Balkema Publ, Rotterdam
Wang QF, Wu JJ, Lei TJ, He B, Wu ZT, Liu M, Mo XY, Geng GP, Li XH, Zhou HK et al (2014) Temporal-spatial characteristics of severe drought events and their impact on agriculture on a global scale. Quat Int 349:10–21. https://doi.org/10.1016/j.quaint.2014.06.021
Cai W, Zhang Y, Chen Q, Yao Y (2015) Spatial patterns and temporal variability of drought in Beijing–Tianjin–Hebei metropolitan areas in China. Adv Meteorol. https://doi.org/10.1155/2015/289471
Dinpashoh Y, Fakheri-Fard A, Moghaddam M, Jahanbakhsh S, Mirnia M (2004) Selection of variables for the purpose of regionalization of Iran's precipitation climate using multivariate methods. J Hydrol 297(1–4):109–123. https://doi.org/10.1016/j.jhydrol.2004.04.009
Demšar U, Harris P, Brunsdon C, Fotheringham AS, McLoone S (2013) Principal component analysis on spatial data: an overview. Ann Assoc Am Geogr 103(1):106–128. https://doi.org/10.1080/00045608.2012.689236
Jolliffe IT, Trendafilov NT, Uddin M (2003) A modified principal component technique based on the Lasso. J Comput Graph Stat 12:531–547. https://doi.org/10.1198/1061860032148
Zou H, Hastie T, Tibshirani R (2006) Sparse principal component analysis. J Comput Graph Stat 15(2):265–286. https://doi.org/10.1198/106186006x113430
Attorre F, Alfo’ M, De Sanctis M, Francesconi F, Bruno F (2007) Comparison of interpolation methods for mapping climatic and bioclimatic variables at regional scale. Int. J. Climatol 27(13):1825–1843. https://doi.org/10.1002/joc.1495
Lu GY, Wong DW (2008) An adaptive inverse-distance weighting spatial interpolation technique. Comput Geosci 34(9):1044–1055. https://doi.org/10.1016/j.cageo.2007.07.010
Zalenski G, Krajewski WF, Quintero F, Restrepo P, Buan S (2017) Analysis of national weather service stage forecast errors. Weather and Forecasting 32(4):1441–1465
Kolmogorov A (1933) Sulla Determinazione Empirica Di Una Legge Di Distribuzione. G Ist Ital Attuari 4:83–91
Smirnov N (1948) Table for estimating the goodness of fit of empirical distributions. Ann Math Stat 19:279–281. https://doi.org/10.1214/aoms/1177730256
Razali NM, Shamsudin NR, Maarof NNNA, Ismail A (2012). A comparison of normality tests using SPSS, SAS and MINITAB: an application to health related quality of life data. In: 2012 international conference on statistics in science, business and engineering (ICSSBE). IEEE, pp 1–6
Yap BW, Sim CH (2011) Comparisons of various types of normality tests. J Stat Comput Simul 81(12):2141–2155
Acknowledgements
The authors are thankful to editor and three anonymous reviewers for their valuable comments to improve the quality of manuscript time to time.
Author information
Authors and Affiliations
Contributions
Raha conceptualized the issue, and he has written the whole article. Gayen S.K. has developed methodological framework and managed table and figures.
Corresponding author
Ethics declarations
Conflict of interest
There is no conflict of interest regarding publishing of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Raha, S., Gayen, S.K. Simulation of meteorological drought using exponential smoothing models: a study on Bankura District, West Bengal, India. SN Appl. Sci. 2, 909 (2020). https://doi.org/10.1007/s42452-020-2730-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42452-020-2730-3