1 Introduction

Floods rank among the most wide-reaching and commonly occurring natural hazards in Europe. In the International Disaster Database EM-DAT (Guha-Sapir et al. 2015), flood events account for 36 % of the damages recorded from natural disasters in Europe, followed by storm events (27 %) and earthquakes (21 %). The analysis in this paper is based on data obtained from Munich Re, NatCatSERVICE, 2014 though, which we would like to gratefully acknowledge here. Losses from floods show an increasing trend, which is mostly attributable to socio-economic factors, including population growth, economic development and construction activities in vulnerable areas. However, also climate change is expected to intensify the impacts of flooding (IPCC 2014 Field and Van Aalst 2014). Hence, whilst representing a major issue already today, managing the risk of flooding is expected to become an even more important topic in the future.

Efficient flood risk management requires a combination of risk reduction, risk retention and risk transfer. The latter is defined as shifting the burden of disaster loss to another party (for instance, by means of insurance). It represents an important instrument for managing the risk resulting from natural perils such as floods and can help in mitigating or minimizing disaster losses. A well-implemented plan how to spread economic risks from extreme events within society and/or transfer them from the victims to the financial markets is a fundamental adaptation measure that crucially decides on how impacts from climate change will finally disturb a society. Adequately designed, risk transfer mechanisms even have the potential to generate incentives for individuals as well as the collective to actively engage in risk reduction.

With respect to flood events (and natural catastrophes in general), a broad range of national risk transfer systems can be found in EU member states (see, for example, CEA 2011; IBC 2015; Jongman et al. 2014; Keskitalo et al. 2014; Maccaferri et al. 2012; Seifert et al. 2013; Schwarze and Wagner 2009; Psenner et al. 2012). These vary considerably regarding their organizational structure and design elements. However, what most of these systems have in common is some kind of state intervention. The focus of the present paper is twofold: On the one hand, it tackles the question what the role of the state (or a supranational body such as the EU) can be in the transfer of risk from floods, given that in many member states, there is not sufficient private insurance coverage for flood events. On the other hand, the paper tries to contribute to the flood risk modelling literature. Why are we combining these two quite separate issues? Firstly, the chosen risk modelling technique directly raises the question at what level a public intervention, such as acting as insurer of last resort, would make economic sense. Secondly, we try to argue that if there is a role for public authorities to intervene in the risk transfer process due to market failure, then the topic of better availability of damage data supporting better risk modelling that takes place in the public domain becomes a crucial issue as well. There are several proprietary risk modelling packages around, the validity of which cannot be publicly debated since the model assumptions are not open to the scientific community. This might be considered a problem, if public funds are supposed to play a role in risk transfer mechanisms.

The remainder of this paper is structured as follows: Sect. 2 describes the key challenges of flood insurance and current approaches of dealing with them. Correlated risks of flooding are one such troublemaker, often leading to a breakdown of national insurance supply. However, correlation patterns also can be a positive contribution to handle risks, if we move to international cooperation in risk transfer. It is only in Sect. 4, where we elaborate this thought in more detail as we need to have a concrete look into European flood risk on the national level first. This is carried out in Sect. 3, where we use empirical flood loss data across Europe to calibrate a model and then determine loss quantiles as required for flood risk management. After a short overview of other modelling approaches and describing EVT techniques in the context of flood risk modelling, we apply them to a suitably normalized data set of European flood loss data. A quantitative assessment of diversification potential for flood risk across European countries is given. As mentioned, Sect. 4 then discusses possibilities for the formation of sub-European pooling initiatives, in case a general collaboration on the European level is not feasible. Note that the quantitative model used in this paper does not take into account possibly changing flood risk due to climate change. However, the suggested changes in the flood risk transfer mechanism that are substantiated by the quantitative modelling approach followed in this paper can be seen as a possible major climate adaptation measure to increase the risk resilience of Europe, irrespective of whether the risk per se increases or not. Section 5 discusses the results and concludes.

2 Key challenges in flood insurance and strategies to handle them

From a supply-side point of view, particular conditions need to be fulfilled in order to be capable of maintaining the provision of a working insurance system that is able to transfer, share and reduce risks (see, for example, Kunreuther and Freeman 1996; Kunreuther 2008; Prettenthaler and Albrecher 2009). Preconditions for the insurability of an event usually include the existence of a huge number of similar insurance entities, the determinability, measurability and randomness of the resulting damages—including that the occurrence and severity of the damage is beyond the control of the insured—as well as the calculability of the damage probability. Moreover, damages experienced by the insured entities should occur independently from one another, i.e. they should not hold any catastrophic damage potential. Lastly, premiums are to remain affordable.

Floods—like most other types of natural perils—however, show various characteristics that make the sustainable provision of a working insurance system at affordable price challenging.

2.1 Small risk collective and adverse selection

One challenge relates to the creation of a risk collective of sufficient size (see, for example, Prettenthaler and Albrecher 2009). Floods tend to occur at the same place over and over again. If insuring against floods is voluntary, coverage tends to be demanded mainly in those areas that show an excessive damage probability. The overrepresentation of ‘bad risks’, i.e. with high loss probability, in the collective that demands insurance especially becomes a problem, if insurance companies have difficulties in screening clients (i.e. in case of information asymmetries e.g. due to lacking hazard maps) and/or are not able or allowed to charge risk-based premiums. This situation, where an individual’s demand for insurance is positively correlated with its risk of loss and the insurer is unable to account for this additional risk in the price of insurance (also known as adverse selection problem), causes a vicious circle of increasing premiums and decreasing insurance demand of less endangered potential policyholders. Hence, adverse selection leads to a small risk collective and can threaten the economic viability of an insurance system. Common strategies to counteract the problems of too small risk collectives and adverse selection include the bundling of flood insurance with other kinds of (preferably) uncorrelated perils (e.g. fire or earthquakes) and risk-based premiums (see, for example, Prettenthaler and Albrecher 2009; Botzen 2013). The latter also ranks among the measures against moral hazard, a problem described below. However, it may result in unaffordable premiums in high risk zones.

2.2 Moral hazard

Besides adverse selection, there is a second consequence that may arise from information asymmetries between insurance companies and policyholders and reduce insurance demand: the problem of moral hazard (see, for example, Prettenthaler and Albrecher 2009; Surminski 2014). It may occur after taking out an insurance policy and refers to a change in the insured’s behaviour that causes the probability of loss to be higher than considered when setting up the contract. Examples for changes in the behaviour of the policyholder include reduced efforts of avoiding damages or of keeping them at a minimum. The problem of moral hazard—which is not exclusively related to flood risk but rather represents a key challenge for any insurance product—leads to a costly cycle of losses and hence makes it difficult to maintain the provision of insurance. Strategies in place to reduce the problem of moral hazard include the—however, somewhat costly—monitoring of insured (see, for example, Botzen 2013) and the introduction of deductibles, co-insurance or upper limits on coverage. The effectiveness of the latter tools, however, remains unclear (Surminski 2014).

2.3 Charity hazard

The demand for insurance coverage may also be negatively affected by a lack of risk awareness (if, for example, information on the exposure is not sufficiently available) or by the so-called charity hazard. The latter refers to the tendency of an individual to forego purchasing insurance or taking other precautions provided that ex post governmental assistance or aid from other sources can be anticipated in the event of a disaster (see, for example, Lewis and Nickerson 1989; Browne and Hoyt 2000; Prettenthaler and Albrecher 2009).

2.4 Correlated risks

Besides difficulties in creating a sufficiently large risk collective, there are further factors making the provision of flood insurance at an affordable price challenging: It is, for example, difficult to estimate uncertain low-frequency high-impact risks and, hence, the respective insurance premiums (see, for example, Paudel 2012). Surminski (2014, p. 260) mentioned in this context that ‘flood is often regarded as the most technically challenging type of insurance due to a lack of accurate assessment of exposure, difficulty in estimating the probability of occurrence of an event and potential losses faced’. Another challenge regarding the coverage of flood risk results from the possibility of a catastrophic damage. As natural hazards typically affect large connected areas, the resulting damages are correlated. Hence, large amounts of capital have to be available all at once in order to prevent insolvency on the one hand and to be able to guarantee coverage of the insured damages even in case of damage peaks on the other hand. Precautions such as the introduction of various limits, the development of insurance pools, reinsurance or the involvement of the international capital markets aim at limiting the damage burden for the single insurance company and at ensuring the required capacities (see, for example, Prettenthaler and Albrecher 2009). In the sections that follow, we will show how international cooperation in risk transfer may also help to reduce the capital needed to prevent insolvency.

2.5 State intervention

Despite the described possibilities of coping with the mentioned challenges, it might be difficult or even impossible for the private insurance sector to efficiently provide comprehensive insurance coverage for the whole population on its own. Small risk collectives, a lack of risk awareness among the population, adverse selection and moral hazard as well as the risk of loss accumulation contribute to the facts, (i) that insurance penetration related to flood risk is often very low, (ii) that coverage is strongly limited in many cases or only available at high costs and (iii) that only a small part of the losses resulting from a catastrophic event is covered by insurance. Due to these difficulties, different kinds and extents of state intervention have developed (see, for example, Paudel 2012), which include among others:

  • Providing the necessary framework for private insurance companies to cover flood risk (e.g. through the provision of nationwide hazard maps, risk-minimizing spatial planning and building regulations, legal framework for obligatory coverage extension or compulsory insurance)

  • Subsidizing insurance premiums.

  • Acting as (re)insurer of last resort.

  • Providing ex post ad hoc aid (which, however, may lead to negative incentives such as charity hazard).

  • Managing the insurance scheme.

  • Acting as monopolistic insurance provider.

The adequate role of the state in a nation’s risk transfer system is frequently discussed (see, for example, Paudel 2012) and a variety of systems has evolved in the past, ranging from a rather passive role of the state via the establishment of an adequate framework for the insurance industry through to an active role in compensating private damages (for comprehensive overviews see, for example, CEA 2011; IBC 2015; Jongman et al. 2014; Keskitalo et al. 2014; Maccaferri et al. 2012; Seifert et al. 2013; Schwarze and Wagner 2009 and Psenner et al. 2012). When taking an active role in compensating private damages, acting, for instance, as an insurer of last resort is seen as principally preferable to governmental ex post ad hoc aid (e.g. through compensation funds), since the latter is related to negative incentive effects. The (hypothetic) Solvency II compliant capital requirement for the state as an insurer of last resort is not necessarily much smaller than the Solvency II compliant capital requirement for a primary insurance pool, though: Prettenthaler and Albrecher (2009), for example, calculated both numbers for a public–private partnership (PPP) suggested for Austria, where a national pool was supposed to cover damages up to 3 bn € , whilst the government was responsible to cover the damage beyond that threshold, and the capital requirement proved to be about the same for both.

Thus, the state, even though it is the biggest possible risk collective, still may be overcharged. This does not come as a surprise, since the high (spatial) correlation of flood risk leads to high aggregate risk for any fund whose portfolio is not spatially diversified. But whilst the typical European nation states are composed of only a few river basins (if they belong to more than one at all), the European Union is composed of a big enough number of river basins, rainfall patterns and climatic zones. It only was logic thus that the big flood events of 2002 led to the establishment of the European Union Solidarity Fund (EUSF). ‘The EUSF supplements countries’ own public expenditure to finance essential emergency operations. These include: restoring essential infrastructure e.g. energy, water, health and education; temporary accommodation and costs of emergency services to meet immediate needs; securing of prevention infrastructures, such as dams; measures to protect cultural heritage; clean-up operations. Damage to private property or income loss, considered insurable, is not covered’ (EC Regulation No 2012/2002 and EU Regulation No 661/2014). In that way, the EU has also somehow become an insurer of last resort for the national public expenditures associated with floods. Clearly this engagement does not include the expenditures the member states incur in their own role as insurers of damage to private property. But at least there is a justification for extending the discussion on the role of the state from the level of individual member states to that of the EU as a supranational organization that especially can make use of the diversity of its member states in the sense of risk diversification, if any role is adopted. To make it clear: since the state has to intervene anyhow, providing primary insurance or reinsurance on its own is not a completely awkward option. But it is rather costly for an individual state. However, if a state cooperates with other states, having a different flood risk portfolio can reduce the cost of providing this type of insurance. One would expect that the flood risk across European nations is sufficiently diverse such that there is a strong economic incentive for cooperation.

Of course, it is very unlikely that the EU member states would easily agree upon a specific role, the Union could play in (re)insuring overall national flood risks, given the diversity of approaches being followed in this respect on the national level: Some countries (e.g. Germany, Italy, UK and—just recently—Finland) show (purely) market-based systems, sometimes systematically coupled with state-funded ad hoc relief. Others (e.g. Spain and France) exhibit public or quasi-public monopoly insurance provision. A third group of countries (e.g. Austria, Denmark, Belgium and non-EU member Norway) manage flood risk transfer either mainly via tax-financed public disaster funds or by means of a combination between public disaster fund and private insurance provision. Due to this diversity—evolved in the light of diverse historical and cultural backgrounds—any harmonization project that seeks to prescribe one solution for all European member states seems a very demanding task, whose usefulness is to be questioned critically. Even discussions within nations on reforming the national systems usually take very long as the examples of Romania, the UK, Austria, Germany or the Netherlands show (see, for example, Prettenthaler and Albrecher 2009; Schwarze and Wagner 2009; Seifert et al. 2013; Surminski 2014). This is why the reform option put forward in this paper is of a completely different nature. Instead of harmonizing national legal frameworks, it focuses on exploiting the flood risk diversification potential available within the European Union. Once the diversification potential is known, it may well be that bilateral agreements of risk pooling evolve, not necessitating a unanimous stance towards this issue shared by all member states. It is this diversification potential that is illustrated in the following sections and to the calculation of which we now turn.

3 Modelling flood risk with EVT techniques

3.1 Other approaches

Since our modelling method solely focusses on extreme value theory (EVT), it might be a good idea to also give a short overview on other approaches first: quantitative flood loss modelling has received quite some attention in the recent literature (although due to the complexity of the subject, flood models are recognized to leave considerable uncertainty, see Merz et al. 2004). Flood losses can be divided into direct and indirect damages, each of which may be further divided into tangible and intangible damages (Merz et al. 2010a). The literature on loss modelling has mostly focused on tangible direct damages, since they are easier to quantify than the other kinds of damages. The relativeFootnote 1 or absolute monetary damage is typically explained by characteristics (factors) of the flood, such as inundation depth, flow velocity, duration of inundation, time of the flood (day/night and season), contamination and flood warning (Merz et al. 2010a). In this type of analysis, the exposed assets are commonly divided into homogeneous groups (e.g. type of buildings), and for each group, the relative (or absolute) damage is described as a function of the considered factors. The main factor is usually inundation depth, leading to the so-called depth-damage function, which is derived for a region or a country. For a review of available models, see Merz et al. 2010b. The following flood models are, for instance, in this vein:

  • FLEMOps (Thieken et al. 2008) for the private sector in Germany. The relative loss is explained by water depth, contamination, building type and quality of the building and is based on the loss data obtained between 2002 and 2006 in the Elbe and Danube catchments in Germany.

  • FLEMOcs (Kreibich et al. 2010) for industrial sector in Germany. The relative loss is a function of water depth, contamination, business sector, number of employees and precaution.

  • Multi-coloured manual (Penning-Rowsell et al. 2005) for the UK, where the absolute loss is explained by water depth, flood duration, building type and age.

  • Damage scanner (see, Jongman et al. 2012 for a review) are the standard methods to estimate the flood economic loss based on water depth in the Netherlands.

  • A depth-damage function for the special case of dam breach scenarios was provided by Prettenthaler et al. (2010).

  • The JRC model (Huizinga 2007), which is a pan-European damage model describing absolute monetary loss as a function of water depth. This model is applied by Feyen et al. (2012) to evaluate the impact of climate change on fluvial flood risk in Europe.

Jongman et al. (2012) applied seven damage estimation models (four of them from the above models) to estimate flood losses of two specific regions and compared estimated to observed values. Results were, however, rather unsatisfactory, which indicates that flood risk modelling efforts need further attention, see also Merz et al. (2010b).

Due to their ‘black-box character’—in the sense that they are not publicly available—the approaches used by commercial risk modelling companies (e.g. AIR, RMS, JBA Risk Management, Ambiental Technical Solutions Ltd.) can unfortunately not be discussed in detail in this paper. However, the field seems to be quite dynamic, with regular releases of new or updated models. Lately, commercial flood risk model providers have also emphasized the importance for (European) insurers to look across national borders and think about the spatial correlations of flood risk across Europe (see, for example, Savina 2016). Whereas RMS, for instance, initially only offered flood risk models for individual countries (including the UK, Belgium and Germany), they released a pan-Europe flood model encompassing 13 countries in 2015.

The Solvency II Directive of the European Commission within the new regulatory framework for the European insurance industry prescribes capital requirements for insurance companies according to a Value-at-Risk (VaR) at the 99.5 % level, i.e. insurance companies are required to hold sufficient capital to remain solvent with a probability of 99.5 %. However, linked to the implementation of this rule is the non-trivial task of estimating such extreme quantiles. This is particularly challenging since most often the empirical data available for these estimations are very limited in scope. Clearly, one needs to employ some extrapolation techniques, as in most cases the time interval of available observations does not include such an extreme event. Extreme value theory (EVT) is a natural choice towards that end, see, for example, Beirlant et al. (2004). It provides a tool-kit to model the distribution of extreme events by using patterns of the largest observations. In particular, it is a sensible way to describe tails of a distribution based on smaller observations and hence provides a reasonable way to extrapolate data sets beyond their range, something that is needed to calculate the VaRs. In this paper, we will apply EVT methodology to model the tail of the distribution of flood risks, for individual European countries and for Europe as a whole. In particular, in view of the Solvency II guidelines, we aim to establish estimates on the VaR at the 99.5 % level on the basis of historical loss data, considering also the change in the building stock (value) over the years. Whereas in insurance practice, flood risk may eventually be pooled with other risks, and the VaR reported to the regulator will include a number of further factors such as the assets of the company, loss reserves, the specification of the stand-alone VaR figure is a natural indicator for the underlying flood-specific risks of each country, indicating the potential and likely exposure.

3.2 Methods

Flood loss data often exhibit a heavy tail, with the largest observed values typically dominating the others substantially. Heavy-tailed distributions often provide reasonable fits in such contexts. Popular heavy-tailed distributions are the log-normal distribution with cumulative distribution function (cdf)

$$\begin{aligned} F_Y(y) = \Phi \Big (\frac{\log y-\mu }{\sigma } \Big ), \quad y>0 \end{aligned}$$
(1)

(where \(\Phi\) is the cdf of a standard normal distribution), the Weibull distribution with cdf

$$\begin{aligned} F_Y(y) = 1-\exp [(-y/\tau )^\alpha ], \quad y>0, \end{aligned}$$
(2)

with parameters \(\tau >0\) and \(\alpha >0\) (where the heavy-tailed case \(0<\alpha <1\) is of prime interest in the present context), and the classical Pareto distribution with two parameters \(\alpha ,\theta >0\), which assigns a power decay for the tail

$$\begin{aligned} F_Y(y) = 1-(\theta /y)^\alpha ,\quad y\ge \theta , \end{aligned}$$
(3)

and often provides a very reasonable fit for large losses. Whereas these two distributions result from a simple transformation of a normal and exponential, respectively, random variable (which may be considered as a sufficient justification of its choice, looking for compromise between simplicity, clarity and flexibility of models), the Pareto distribution in addition appears naturally as a limit distribution for maxima of samples (see, for example, Beirlant et al. 2004). As one is mainly concerned with the tail of the fitted distribution, this link also gives guidelines on how to fit such a distribution only to the larger values among all observations. Despite the fact that the shape of much of the distribution tail may be well described by such a model, there may be many situations in practice, for which there is a natural upper bound for loss random variables (such as the overall property value in flood prone areas). In a recent paper, Beirlant et al. (2015) adapted classical EVT techniques to fitting procedures for such a truncated Pareto model, where (3) is adapted to

$$\begin{aligned} F_Y(y) = \frac{1-(\theta / y)^\alpha }{1-(\theta / T)^\alpha }, \quad \theta \le y \le T. \end{aligned}$$
(4)

The upper bound T will typically not be known, but can be estimated from the data. One can then also compare the goodness of fit of various models (with and without truncation) and choose the one that is most plausible for the given data situation.

In this paper, we apply such an analysis to annual flood loss data for residential buildings, aggregated per country for 27 European Union countries.Footnote 2 Out of these 27 countries, 7 countries have less than 5 data points, which we consider to be not sufficient to see enough statistical structure in the data to include them in the analysis. In addition, we also exclude the Netherlands since due to extensive protection levels by dams and dikes, the nature of losses is quite different from those of other countries.Footnote 3 Hence, we restrict ourselves to 19 countries. The last year of reported annual losses for all countries is 2013, but the first year differs from country to country. For each of the 19 countriesFootnote 4 considered within the analysis, Table 1 depicts the first year of reported loss, the total number of years comprised by the record and the number of years with a loss occurring. The maximum amount of losses for each country is included in Table 2. For each country and year, we normalize the loss data by the overall residential building value. Both damage data and building values are inflation adjusted.

Table 1 First year, number of years with data and number of years with losses, per country

For the fitting of the Pareto and truncated Pareto distribution, we apply the method of Beirlant et al. (2015), which is an adaptation of the classical Hill estimator. This method works remarkably well on simulated data with truncation and in the limit of the truncation parameter T going to \(\infty\), the procedure retains the classical EVT Hill estimator for \(\alpha\). Note that if the largest k data points are used for the estimation, the parameter \(\theta\) is naturally chosen to be the smallest of these used values. There are various suggestions and algorithms available in the literature towards a suitable choice of k. Here we adopt a covariance criterion (as suggested in Beirlant et al. (2015)).

The parameters of the log-normal and Weibull distributions are estimated by using classical maximum likelihood estimation. We use Q–Q plots to compare the quality of fits among each other and on the basis of those (together with empirical mean-excess plots and related methods) decide for the most suitable among the different models for each country. Clearly, there is a certain degree of subjectivity in the corresponding choice, but in view of the small number of available data points, one cannot use classical goodness-of-fit tests (like a \(\chi ^2\)-test), which is a well-known artefact in EVT. The procedure adopted here seems to be a reasonable compromise between scientific rigor, intuition and experience when working with this kind of data. For illustration, Fig. 1 depicts the log–log plot

$$\begin{aligned} (\log X_{n-j+1,n},\log (j/n)), \quad j=1,\ldots ,n, \end{aligned}$$
(5)

for those countries which pass the test for a truncated Pareto distribution given in Beirlant et al. (2015). One clearly sees the deviation from the (linear) non-truncated Pareto pattern at the right-hand end of the plots.

Fig. 1
figure 1

Log–log plots for annual flood loss data by country. The dashed blue line refers to a Pareto fit, whereas the solid black line refers to a truncated Pareto fit

3.3 Fitted models and risk measures

Figure 2 depicts the Q–Q plots as a measure of goodness of fit for each candidate distribution and each country (in those cases where the truncated Pareto assumption is rejected, we do not give the respective Q–Q plot, as then no estimate for the upper bound is available). From these plots, we choose the most suitable model for each country. For Croatia, the fit is not very satisfactory (possibly due to insufficient data), we still include it in the analysis, but give a word of caution to the respective numbers. Given that insurance solutions should account also for a massive failure of dams, a high protection level by dams should be treated separately. Here the given numbers do certainly not reflect the risk under a major ‘failure of dams’ scenario. For Europe, we aggregated the annual losses over all countries and then applied the same fitting procedure as for the individual countries.

For determining the VaR (i.e. the quantile) at the 99.5 % level, which is a quantity relevant for solvency purposes, there is a simple procedure in the present case: if only data points larger than \(\theta\) are finally used for the estimation of the tail, and a conditional distribution \(F_Y\) is chosen to be the best model above that level \(\theta\), that is

$$\begin{aligned} \mathbb {P} \{ Y \ge y | Y \ge \theta \} = 1-F_Y(y), \end{aligned}$$
(6)

it follows that

$$\begin{aligned} \mathbb {P}(Y \ge y) = \zeta _{\theta }(1-F_Y(y))\quad:= \quad 1-\beta , \end{aligned}$$

where \(\zeta _{\theta }=\mathbb {P}(Y > \theta )\) can be estimated by k / n (here k is the number of used observations and n is the size of the entire sample) and \(\beta\) is the level at which the VaR is calculated. For a chosen value of \(\beta\) (0.995 in the present case), the VaR is then the implicit solution of the above equation for y.

In recent years, several alternatives to VaR as a risk measure have been proposed. The most prominent alternative is expected shortfall at confidence level \(\beta\)

$$\begin{aligned} \text {ES}_\beta (Y) = \mathbb {E}(Y| Y\ge \text {VaR}_\beta (Y)) \end{aligned}$$
(7)

This risk measure is, for example, implemented in the Swiss Solvency Test for \(\beta =0.99\). It has some desirable properties, and even if most European countries due to regulatory rules adhere to the VaR, we give here also the respective ES numbers for comparison.

Table 2 depicts for each country the chosen distribution, the largest observed insurance loss (both in absolute terms and in % of the building value) and the fitted upper bound T (in case the truncated Pareto distribution is the best model). The fifth column gives the building values of 2013, whereas the last four columns present the resulting VaR (99.5 %) and ES (99 %) in terms of percentage of 2013 building values, and in absolute value, in addition, the value at risk and expected shortfall for all mentioned distributions are given in ‘Appendix’. One sees that for some countries the ES is considerably higher than the respective VaR, particularly for those countries where the log-normal distribution is the best fit. Figure 3 depicts the relative value of the resulting VaR for each country, indicating which countries are most prone to flood risk, given the data analysed.

Fig. 2
figure 2

QQ-plots for annual flood loss data (in logarithmic scale) by country. The black circles, blue squares, red triangles and green pluses, respectively, refer to a truncated Pareto, Pareto, log-normal and Weibull fit

Table 2 Chosen model, largest observed insurance loss, fitted upper bound T, total building value 2013, 99.5 % VaR and 99 % ES for 19 EU member states and their aggregate
Fig. 3
figure 3

Relative value of the 99.5 %-VaR, per country

4 Joint risk pooling: a thought experiment

The sum of the VaRs of the individual countries in Table 2 represents the overall capital amount that would be needed at the 99.5 % safety level in case each country dealt with flood risks stand-alone. With 141,052 Mio € , it is almost four times as large as the VaR calculated on the basis of the aggregated loss data of all 19 member states (referred to as ‘Europe’ in Table 2), which incorporates the dependence structure between the individual countries. In other words, pooling the flood risk across these 19 EU member states reduces the capital requirementsFootnote 5 to 38,446 Mio €. Hence, there is a strong diversification potential for pooling flood risk across countries. Based on this observation, the implementation of a flood damage pool or a joint reinsurance either at EU level, or—if no agreement at EU level could be achieved—between subsets of EU countries, seems a reform option worthwhile to consider. Given the available data set and based on our model assumptions, we can hence identify the most effective Joint Risk Pooling Initiatives (JRPIs) in terms of reducing solvency capital requirements.

To that end, we divide the 19 EU member states into two groups. The first group has a rather satisfactory data base available (correspondingly the goodness of fit of the calibrated models is quite satisfactory) and constitutes the countries with larger losses (in fact about 80 % of the overall reported losses in the data set, and—coincidentally—also about 80 % of the total building values are located in these countries). This group includes the seven countries Austria, France, Germany, Italy, Portugal, Spain and UK. The second group consists of the remaining 12 countries, for which there are fewer data points available, but also flood risk is less prominent.

For the pooling of countries of Group 1, denote by \(X_1,X_2,\ldots ,X_7\) the random variables representing losses of these seven countries. The goal is now to find the number of clusters G (and the number \(N_g\) of member countries in cluster g) which minimizes the sum of VaRs of the G clusters:

$$\begin{aligned} \text {VaR}_G = \text {min}\sum _{i=1}^G \text {VaR}\Big ( \sum _{j\in N_g}X_j\Big ) \end{aligned}$$
(8)

For each cluster, we aggregate the loss data of the \(N_g\) countries, fit a truncated Pareto distribution to these and compute the respective VaR (with the exception of France, the truncated Pareto was the best model for these countries anyway). Doing this analysis for all possible combinations of two and three clusters, we can identify the optimal clusters. Tables 3 and 4 show the countries and VaR figures of the resulting clusters. In the last two columns, we also give the corresponding numbers when ES is used as the risk measure.

Table 3 VaR and ES values of two clusters
Table 4 VaR and ES values of three clusters

For pooling of the remaining 12 countries (Group 2), the available data situation does not allow to proceed in the same way as for Group 1, because there are not enough joint data points available for losses in the same years. One can instead use a hierarchical clustering algorithm (see, for example, Kaufman and Rousseeuw (2009)). In general, clustering by such an algorithm is based on a certain measure of distance between the clustering objects, with the goal to minimize the distance within and maximize the distance between clusters. In the present context, this distance can be the pairwise correlation, so that each cluster consists of the countries that are the least correlated with each other, in view of diversification benefits. To determine the pairwise correlation between losses of the countries, one can, for example, use the classical Pearson correlation

$$\begin{aligned} r_P = \frac{(x_i-\bar{x})(y_i-\bar{y})}{\sqrt{\sum _{i=1}^n(x_i-\bar{x})^2}\sqrt{\sum _{i=1}^n(y_i-\bar{y})^2}}, \end{aligned}$$

or the Spearman rank correlation

$$\begin{aligned} r_S = \frac{(rg(x_i)-\overline{rg}_x)(rg(y_i)-\overline{rg}_y)}{\sqrt{\sum _{i=1}^n(rg(x_i)-\overline{rg}_x)^2}\sqrt{\sum _{i=1}^n(y_i-\overline{rg}_y)^2}}, \end{aligned}$$

where \((x_1,y_1),\ldots ,(x_n,y_n)\) are the joint observations of n flood losses, and \(rg(x_i)\) is the rank of observation \(x_i\) in the univariate sample. Recall that the Spearman rank correlation is the Pearson correlation coefficient of the grades of the distribution, and in that way measures for monotonic relationships between two random variables, whereas the Pearson correlation coefficient itself measures linear correlation only. Linear correlation is a very natural concept in the world of normal distributions, but in the present context, the marginal random variables are strongly skewed and heavy-tailed so that the Spearman correlation may be seen as a more natural choice (see, for example, Embrechts et al. (2002)). Correspondingly we use it here. Table 5 shows the pairwise Spearman correlation of countries in Group 2. When deciding for three clusters according to the clustering algorithm described above, they turn out to be

  1. 1.

    Croatia and Czech Republic

  2. 2.

    Bulgaria, Greece, Ireland and Romania

  3. 3.

    Belgium, Cyprus, Hungary, Poland, Slovakia and Sweden.

If one only allows for two clusters of countries, the first cluster is as above and the second cluster is the aggregation of the second and third cluster above.

Table 5 Pairwise Spearman correlation for relative losses of countries in Group 2

Having identified reasonable clusters on the basis of bivariate correlations, it would now be the next step to calculate VaR figures for the resulting clusters. However, the scarce data situation for the countries of Group 2 does not allow for reasonable estimates on quantiles of each cluster. For fitting a distribution to the aggregated losses of a cluster, only data from those years covered by the loss record of every country in the cluster can be used. Unfortunately, this reduces the number of available summed claims too much to provide meaningful fits for their distribution (and then estimate a 99.5 % VaR), and we hence restrict ourselves at this point to simply outlining the clusters as given above. In addition, we remind the even more severe data limitation for Croatia.

Remark Tables 23 and 4 indicate that under suitable collaboration between countries, diversification can significantly reduce the total required solvency capital for flood insurance. Such numbers bring up the question on how such a reduced aggregate capital requirement should then be subdivided (‘allocated’) to the individual countries. In the literature, some suggestions for that have been developed, among them the Euler allocation principle, see, for example, Tasche (2007). In order to implement such an allocation procedure, one needs to fully specify a dependence model for the multivariate loss distribution across countries. However, due to the limited amount of available data, we do not pursue the formulation of such a concrete dependence model in the present paper, as our focus is on the aggregate view. Clearly, with a more refined data set available, it will be interesting to elaborate further on such aspects in the future.

5 Conclusion

In this paper, we discussed some challenges in flood risk assessment and management for Europe. High spatial correlation of flood damages is one of the obstacles for a well-functioning flood insurance scheme on the national level. However, understanding the dependence structure across national flood damage data, (non-)correlation can become part of the solution to provide flood insurance for Europe: Based on a data set comprising annual flood losses of most EU member states, we calibrated flood models using a methodology for truncated distributions developed recently in extreme value theory. The resulting models provided a tool to quantify flood risk diversification potentials. Based on the results, our suggestion is to exploit Europe’s magnitude and diversity related to flood risk by jointly buying reinsurance or forming a risk pool. In case a collaboration on the entire EU level is not feasible, the voluntary establishment of Joint Risk Pooling Initiatives (JRPIs) between subsets of EU countries seems an option worthwhile to consider. In the paper at hand, we not only created the methodological framework to actually quantify concrete diversification benefits of such JRPIs, but also did some explicit calculations where feasible for the data. The results presented in the paper are clearly subject to and limited to the quality of the available data, and it will be an interesting challenge to deepen the analysis in the future with improved and enlarged data sets. For example, to our knowledge smaller countries that have a strong national reinsurance tradition may be under-represented in the utilized data set (stemming from an international reinsurer); thus, some more data gathering activities might be a valuable undertaking from an EU-wide perspective.

For our analysis, the question who actually takes the responsibility to insure a nation’s flood risk is not relevant. How deeply the state government should be involved in risk transfer also may be a matter of taste and economic policy doctrines. What our analysis clearly shows, however, is that European flood risk is diversified in a way such that there are strong economic incentives to pool the national risk portfolios. What also becomes clear from the analysis is that once some role of public governance for a functioning risk transfer mechanism for flood risk is accepted, the question of sound (damage) data provision and publicly available risk models based on such sound data also becomes a key requisite for providing cheap and sound insurance solutions.