Evaluation of heterogeneity statistics as reasonable proxies of the error of precipitation quantile estimation in the Minneapolis-St. Paul region

doi:10.1016/j.jhydrol.2014.03.056

Journal of Hydrology

Volume 513, 26 May 2014, Pages 457-466

https://doi.org/10.1016/j.jhydrol.2014.03.056 Get rights and content

Summary

Estimating precipitation frequency is important in engineering, agriculture, land use planning, and many other disciplines. The index flood method alleviates small sample size issues due to short record length by calculating normalized quantile estimates for averaged data from a “region” of gauges. For a perfectly homogeneous region this adds no error; heterogeneity statistics seek to quantify a real-world region’s deviation from this assumption. Hosking and Wallis (1997) introduced a Monte Carlo heterogeneity statistic called here $H_{1}$ and used a simulation study to assess its utility while rejecting two similar statistics called here $H_{2}$ and $H_{3}$ . A nearly linear relationship was found between $H_{1}$ and the percentage root mean square error (RMSE) increase due to heterogeneity, establishing $H_{1}$ as a “reasonable proxy” of quantile error. The $H_{1}$ -percent RMSE added relationship found in the simulation experiment was used to find equivalent RMSEs for heterogeneity thresholds against which all three H statistics were tested. In this study the “reasonable proxy” relationship is evaluated across a highly skewed daily precipitation dataset in Minnesota for $H_{1}, H_{2}$ and $H_{3}$ . Simulated regions used in quantile error estimation are generated using at-site L-moment ratios scaled toward the regional mean with a shrinkage multiplier. A linear relationship is found between Monte Carlo estimates of quantile RMSE and both $H_{1}$ and $H_{2}$ across all possible regionalizations of twelve gauges. $H_{2}$ ’s relationship is less linear than $H_{1}$ ’s as quantified by Pearson’s r. A synthetic study is also undertaken using the same sample sizes, regional L-moment averages, and between-site variations as the Hosking and Wallis (1997) simulation. The $H_{2}$ -percent RMSE added relationship is found to be nearly as linear as for $H_{1}$ , complementing the enumeration study’s findings. Because $H_{2}$ ’s linear relationship with percent RMSE added has approximately one-fourth the slope of the $H_{1}$ -RMSE relationship, heterogeneity thresholds calculated with reference to $H_{1}$ should not be applied to $H_{2}$ . $H_{2}$ thresholds can be derived from the $H_{2}$ -percent RMSE added relationship in analogous fashion to the method used in Hosking and Wallis (1997) for $H_{1}$ . The resulting thresholds are one-fourth the magnitude of the $H_{1}$ thresholds.

Highlights

•
Heterogeneity was tested as a proxy of quantile error for daily rainfall totals.
•
Monte Carlo estimates were calculated for all possible regions of a gauge network.
•
Two heterogeneity statistics were found to be reasonable proxies of error.
•
Previous findings held only one of these to be a reasonable proxy.

Introduction

Rare or extreme precipitation events, which include events classified as natural disasters, have major ecological, economic, and public safety significance. Sample size is often a limiting factor in the estimation of extreme hydrological events; one rule of thumb for flood frequency estimation is that for reliable estimates of a return period T the record length in station-years must exceed $5 T$ (Robson and Reed, 1999). Many statistical hydrologists have followed the index flood method of Dalrymple (1960), in which sample size is increased by grouping gauges, or“sites”, into regions, calculating a regional “growth curve” normalized by an index such as the mean or median of the at-gauge data, and estimating at-site quantiles by multiplying the index flood and the regional growth curve.

Linear moments, analogues of conventional central moments like skewness and kurtosis based on probability weighted moments (Greenwood et al., 1979), are often used in this context. L-moment estimators have lower bias than other common methods of estimation at small sample size (Hosking et al., 1985, Lettenmaier et al., 1987). They are less biased than conventional moment estimators, are not bounded by sample size, and are more robust to outliers. L-moment ratios can be more reliably predicted from a subsample than conventional moment ratios. L-moment ratios, the second through fourth of which are denoted the coefficient of L-variance (L-CV), L-skewness, and L-kurtosis (the first L-moment ratio does not exist), provide greater insight into the underlying distribution of high-skew data than conventional moment ratios. For example, L-moment ratio diagrams are used as decision aides for identifying the underlying distribution of regional data (Hosking, 1990, Vogel and Fennessey, 1993, Hosking and Wallis, 1997, Zafirakou-Koulouris et al., 1998).

L-moment analysis of regions formed according to hydrological characteristics has been conducted in recent decades on streamflow (Vogel et al., 1993, Ouarda et al., 2008, Noto and Loggia, 2009) and precipitation data (Guttman et al., 1993, Werick et al., 1994, Adamowski et al., 1996, Alila, 1999, Smithers and Schulze, 2001, Kyselý et al., 2007, Modarres and Sarhadi, 2011). L-moment ratios for daily data series using “wet-day” (non-zero only) and full datasets have been evaluated across the United States (Hanson and Vogel, 2008). Regional frequency analysis models using fuzzy regions (Jingyi and Hall, 2004, Rao and Srinivas, 2006) and fractional-membership regions of influence (Burn, 1990, Zrinji and Burn, 1994, Gaál et al., 2008) represent alternatives to the strict regional membership model.

The regional pooling mechanism of the index flood method involves the assumption of homogeneity across the sites in a candidate region - at-site differences in L-moment ratios are assumed to be due solely to sampling variability. The degree to which the homogeneity assumption is violated is therefore likely to be related to quantile error. Statistics quantifying the heterogeneity of a region based on Monte Carlo simulation have been proposed which sample from a Generalized extreme-value distribution (Lu and Stedinger, 1992, Alila, 1999).

Hosking and Wallis (1997) define three statistics based on the between-site variation of L-moment ratios, $H_{1}, H_{2}$ , and $H_{3}$ . All three H statistics fit the flexible four-parameter Kappa distribution with the average L-moment ratios of the region in question and use Monte Carlo simulation to generate simulated regions from the Kappa. For the real region and for each simulated region a statistic called $V_{1}, V_{2}$ , or $V_{3}$ is calculated using the sum of the squared difference between each site’s L-moment ratio values and the regional average. $V_{1}$ uses only the L-CV, $V_{2}$ incorporates L-CV and L-skewness, and $V_{3}$ incorporates L-skewness and L-kurtosis. $H_{1}$ is calculated when $V_{1}$ for the real region minus the mean of $V_{1}$ for simulated regions is divided by the standard deviation of simulated regions’ $V_{1}; H_{2}$ and $H_{3}$ are calculated analogously (see Eqs. (6), (7), (8), (9)).

A simulation study is used in Hosking and Wallis (1997) to reject $H_{2}$ and $H_{3}$ and to accept $H_{1}$ . Heterogeneous regions’ RMSEs are divided by their equivalent homogeneous region’s RMSE. This isolates the RMSE increase due to heterogeneity. $H_{1}$ is shown to have a linear relationship with percent RMSE added due to heterogeneity. Results for $H_{2}$ and $H_{3}$ are not reported.

Hosking and Wallis (1997) define thresholds below which regions can be considered “possibly” and “definitely” heterogeneous with reference to a range of percent RMSE added values implied by the $H_{1}$ -percent RMSE added relationship. $H_{1} = 1$ is found to indicate a 20–40% increase in RMSE, while $H_{1} = 2$ is associated with 40–80% increases. These thresholds are also applied to $H_{2}$ and $H_{3}$ , which are found to rarely exceed them.

Viglione et al. (2007) investigate $H_{1}$ and $H_{2}$ as well as two nonparametric heterogeneity statistics by measuring the fraction of simulated regions that are correctly and incorrectly identified as heterogeneous. The threshold of $H = 2$ is used for both $H_{1}$ and $H_{2}$ . They confirm the utility of $H_{1}$ for simulated data with L-skewness below 0.23 and reject $H_{2}$ . The bootstrap Anderson–Darling test is found to be more powerful than either statistic for data with higher skewness.

Two approaches are used in this study to quantify the power of $H_{1}, H_{2}$ , and $H_{3}$ as proxies of error due to heterogeneity. The original Hosking and Wallis (1997) simulation study is recapitulated and results for $H_{2}$ and $H_{3}$ are presented alongside those for $H_{1}$ . Thresholds for $H_{2}$ are found using its linear relationship to quantile error, not $H_{1}$ ’s. An enumeration study is also conducted, estimating $H_{1}, H_{2}, H_{3}$ , and quantile error for all possible regionalizations of a small daily precipitation gauge dataset. Components of error unrelated to heterogeneity are preserved in this study, allowing the heterogeneity statistics’ relationships with total estimated quantile error to be compared to the ideal case represented in the simulation experiment.

The remainder of this paper is structured as follows. Daily precipitation gauge data from which all possible regions are to be enumerated are presented in the following section, Section 2. Section 3 introduces the equations and methods used to calculate linear moments of the data, estimate regional heterogeneity, assign a regional distribution, estimate the RMSE of regional quantile estimates, and perform a simulation experiment analogous to that presented in Hosking and Wallis (1997). Section 4 presents the results first of the simulation experiment, then of the enumeration experiment, which compares heterogeneity and error estimates across all possible regions formed from the selected precipitation gauges. Section 5 discusses the results; Section 6 summarizes the paper and offers conclusions and potential avenues of future research.

Section snippets

Data

Mean annual precipitation in Minnesota ranges from the low teens to above 30 in., with the mean annual precipitation generally increasing from the northwest to the southeast. Moist air carried from the Gulf of Mexico is an important source of precipitation in Minnesota. Almost half of yearly precipitation occurs in June, July, and August (Baker et al., 1967). Rainfall quantiles have been estimated for Minnesota using the Generalized extreme-value distribution on the annual maximum series (

Linear moments

The Hosking (1990) approach begins from the probability weighted moments (PWM) of Greenwood et al. (1979), defined in Eq. (1): $β_{r} = E {X {[F (X)]}^{r}}$ where $F (X)$ is the cumulative distribution function (cdf) of $X, X [F]$ is the inverse cdf or quantile function of X for probability F, and $β_{r}$ is the rth-order PWM ( $β_{0}$ is equal to the mean $μ = E [X]$ ). Hosking (1990) defines the L-moments $λ_{r + 1}$ in Eq. (2): $λ_{r + 1} = \sum_{k = 0}^{r} p_{r, k}^{*} β_{k}$ with $p_{r, k}^{*}$ calculated according to Eq. (3): $p_{r, k}^{*} = \frac{{(- 1)}^{r - k} (r + k)!}{{(k!)}^{2} (r - k)!}$

$λ_{1}$ is the mean of a

Results of simulation experiment

The simulation experiment described in Section 3.5 outputted similar results to those described in Table 4.1 of Hosking and Wallis (1997) for $H_{1}$ . Averages of the H statistics are taken across 100 simulations for each set of L-moment ratios. In Fig. 3, depicting the linear relationship between each H statistic and percent RMSE added due to heterogeneity for a non-exceedance frequency of 0.01, results for $H_{1}$ are similar to Fig. 4.2 in Hosking and Wallis (1997). Results for $H_{2}$ and $H_{3}$ are not

Discussion

In the simulation experiment the performance gap between $H_{2}$ and $H_{1}$ is relatively narrow, indicating that L-skewness offers a useful amount of heterogeneity information in the presence of L-CV variation. $H_{2}$ is consistently a slightly less faithful proxy for error than $H_{1}$ across the simulated regions, but like $H_{1}$ it can also be plotted against percent RMSE added due to heterogeneity and threshold values can be described as equivalent to a range of percent RMSE added. Analogously to the process

Summary and conclusions

Hosking and Wallis (1997) find that $H_{1}$ has power as a proxy of error while $H_{2}$ and $H_{3}$ do not, but simulation and enumeration studies conducted here paint a more nuanced picture. $H_{1}$ remains the favored heterogeneity statistic across simulated and real-world datasets across a wide range of skewness, but $H_{2}$ is nearly as effective. The efficacy of $H_{2}$ has been obscured through the application of thresholds constructed with reference to the linear relationship between $H_{1}$ and percent RMSE added, which

Acknowledgements

Data were provided by the State Climatology Office, Minnesota Department of Natural Resources–Division of Ecological and Water Resources. This work also used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation Grant No. OCI-1053575. Financial support provided by the George Mason University Presidential Scholarship is gratefully acknowledged. The authors also wish to thank Jason Giovannettone for his assistance.

References (34)

K. Adamowski et al.
Regional rainfall distribution for Canada
Atmospheric Research
(1996)
K.A. Blumenfeld et al.
Using a high-density rain gauge network to estimate extreme rainfall frequencies in Minnesota
Appl. Geograp.
(2011)
Z. Jingyi et al.
Regional flood frequency analysis for the Gan-Ming River basin in China
J. Hydrol.
(2004)
L.H. Lu et al.
Sampling variance of normalized GEV/PWM quantile estimators and a regional homogeneity test
J. Hydrol.
(1992)
R. Modarres et al.
Statistically-based regionalization of rainfall climates of Iran
Global Planet. Change
(2011)
T.B.M.J. Ouarda et al.
Intercomparison of regional flood frequency estimation methods at ungauged sites for a Mexican case study
J. Hydrol.
(2008)
A.R. Rao et al.
Regionalization of watersheds by fuzzy cluster analysis
J. Hydrol.
(2006)
J.C. Smithers et al.
A methodology for the estimation of short duration design storms in South Africa using a regional approach based on L-moments
J. Hydrol.
(2001)
Z. Zrinji et al.
Flood frequency analysis for ungauged sites using a region of influence approach
J. Hydrol.
(1994)
Y. Alila
A hierarchical aproach for the regionalization of precipitation annual maxima in Canada
Journal of Geophysical Research
(1999)

D.G. Baker et al.

Climate of Minnesota: Part V: Precipitation Facts, Normals and Extremes

(1967)

D.H. Burn

Evaluation of regional flood frequency analysis with a region of influence approach

Water Resour. Res.

(1990)

Dalrymple, T., 1960. Flood Frequency Analyses: Manual of Hydrology: Part 3. Flood-Flow Techniques. Water-Supply Paper...

L. Gaál et al.

Region-of-influence approach to a frequency analysis of heavy precipitation in Slovakia

Hydrol. Earth Syst. Sci.

(2008)

J.A. Greenwood et al.

Probability weighted moments: definition and relation to parameters of several distributions expressable in inverse form

Water Resour. Res.

(1979)

N.B. Guttman et al.

Regional precipitation quantile values for the continental United States computed from L-moments

J. Clim.

(1993)

Hanson, L.S., Vogel, R., 2008. The probability distribution of daily rainfall in the United States. In: World...

Cited by (0)

View full text

Published by Elsevier B.V.

Evaluation of heterogeneity statistics as reasonable proxies of the error of precipitation quantile estimation in the Minneapolis-St. Paul region

Summary

Highlights

Introduction

Section snippets

Data

Linear moments

Results of simulation experiment

Discussion

Summary and conclusions

Acknowledgements

Atmospheric Research

Appl. Geograp.

J. Hydrol.

J. Hydrol.

Global Planet. Change

J. Hydrol.

J. Hydrol.

J. Hydrol.

J. Hydrol.

A hierarchical aproach for the regionalization of precipitation annual maxima in Canada

Journal of Geophysical Research

Climate of Minnesota: Part V: Precipitation Facts, Normals and Extremes

Evaluation of regional flood frequency analysis with a region of influence approach

Water Resour. Res.

Region-of-influence approach to a frequency analysis of heavy precipitation in Slovakia

Hydrol. Earth Syst. Sci.

Probability weighted moments: definition and relation to parameters of several distributions expressable in inverse form

Water Resour. Res.

Regional precipitation quantile values for the continental United States computed from L-moments

J. Clim.