Quality and performance of a PM10 daily forecasting model☆
Introduction
Particularly during the winter season, the basin areas of the Alps (including the cities of Graz, Klagenfurt and Bolzano) are exposed to weather conditions such as stationary temperature inversions, a low amount of precipitation and low wind velocities. These special weather conditions cause an extensive load of particulate matter (PM) in ambient air. The issue of PM/fine dust has recently caught remarkable attention and is still a very present and explosive topic in science and politics. The PM10 (particles with an aerodynamic diameter ) concentration is measured in units of . According to the EU framework directive 1999/30/EC (European Community, 1999) the limit value for the daily PM10 average is and must not be exceeded on more than 35 days of the year (valid for the years 2005–2009); a reduction to 7 days in 2010 is envisaged. Additionally, the annual PM10 average must not exceed the limit of ( starting with 2010). However, at the test point GT1 in Graz (near the pedestrian zone in the center of the city) we observed in the period 2003–2006 between 90 (2004) and 137 (2003) exceedances of the daily PM10 limit and registered high annual averages between 41 and . The physical and chemical composition of the particles is very complex. There are natural sources like pollen or crushing and grinding rocks and soil (primary particles). Contrarily, there are particles which arise from aerially pollutants (secondary particles). Anthropogenic particles are produced by traffic, domestic fuel and industry. They may be directly exhausted by burning processes or arise from mechanical abrasion of tyres, brakes, tarmac, etc. The coarse particles (PM10 –2.5) are composed of smoke, dirt and dust, the fine particles (PM2.5) are rather toxic organic compounds or heavy metals. Estimates about the proportion of the PM polluters differ widely and have to be related to the specific environment (urban, rural, seaside, etc.). For further information on aerosol source analysis in urban and rural areas of Austria we refer to the Aquella project http://www.iac.tuwien.ac.at/environ/aquella.html, which is still in progress.
Negative health effects caused by PM have been analyzed in many epidemiological and toxicological studies. An extensive general review can be found in Pope III and Dockery (2006). A description of the Austrian study AUPHEP is given in Hauck et al. (2004), while Schwarze et al. (2006) is an excellent review including 210 references of studies carried out in different regions throughout the world. In general, fine particles (PM2.5) are more likely to be toxic since they often consist of heavy metals and carcinogenic organic compounds. Furthermore, they are inhaled into the trachea and the respiratory system in general. In Klagenfurt and Bolzano where measurements for PM2.5 are available, it can be observed that the ratio PM2.5/PM10 is relatively constant. Hence investigations on PM10 in this region allow to draw some conclusions on finer PM fractions as well. In Bolzano about 40% of PM10 belong to the fine fraction, while the percentage in Klagenfurt is approximately twice as high. In contrast to the rare PM1 and PM2.5 data sources, extensive data on PM10 were available in all three cities. Furthermore, since the EU limits refer to PM10 we based our research on this specific measure.
Due to the negative health effects caused by PM10, policy had to react (and take drastic measures) against the PM problem. In the winter season 2006/2007, the cities of Bolzano, Graz and Klagenfurt as well as the respective Provinces were forced to take further action against high PM10 concentrations of the ambient air. In general, traffic regulations become effective if the limit values are exceeded for several days and if a reduction on the day in question is “unlikely”. For the appropriate authorities it is necessary to base singular decisions on reliable forecasting models for daily PM10 concentrations. The objective of the present paper is to deliver PM10 forecasting models for the specific sites and to illustrate their practical applicability as a decision making tool for possible traffic restrictions. As the traffic regulations mentioned above regard the period between October and March, we based our models on that specific time of year. Better dissemination conditions for the ambient air during spring and summer lead to considerably lower PM10 concentrations and thus, there is no urgent need for action in that warm season of the year. The investigations were made within the framework of the EU-Life-project KAPA GS (Klagenfurts Anti PM10 Action Programme in cooperation with Graz and South-Tyrol: project duration from 1 July 2004 to 30 September 2007).
Our models are based on multiple linear regression which we found to be the most convenient method. Other typical approaches for PM prediction are neuronal networks (cf. Pérez and Reyes, 2002, Hooyberghs et al., 2005), discriminant analysis (cf. Silva et al., 2001) or Kalman filtering (cf. van der Wal and Jansen, 1999). The demand for our model was simplicity, practical feasibility and sufficient accuracy. Simplicity is guaranteed by the linear structure of the model. To obtain practical feasibility, it is vital to perform a careful choice of parameters. Meteorological parameters, for example, have to be forecasted individually (type-B parameters) in operational mode and thus it has to be assured that this additional uncertainty will not prevail. Hence, the precision of the prediction for a specific day will to some extent depend on the quality of the singular weather forecasts. Our empirical studies showed that temperature inversion, precipitation and wind velocity play an important role at all three sites. After consulting the ZAMG (Styrian meteorologic office) we decided to include variables which are representative for these impact factors and can also be forecasted with sufficient precision. In order to guarantee sufficient accuracy, we included all relevant parameters available or measured at the time when the forecasts were generated (type-A parameters), e.g. the PM10 24 h moving average.
In contrast to many theoretical studies, we are able to present the performance of the model in operational mode. During a three-year trial period in Graz we made PM10 forecasts available at the web site of our project (http://www.feinstaubfrei.at). For the generation of the predictions we used meteorological forecasts from ZAMG Styria where the meteorologists provided us with analyzed simulation data by the systems ECMWF and Aladin. We observed that PM10 forecasts for several days do not loose much quality if the type-B parameters were known. However, we found that forecasts of two or more days in advance will become unreliable in practise.
By virtue of the high dispersion of PM10 data and our specific requirements we found that commonly used measures do not reveal the quality of the forecasts with respect to our needs. A forecast of for an observation of , for example, yields a 100% relative error, though both the observation and the prediction are clearly below the limit value. On the other hand we observe peaks with values . Typically, the model under-estimates jerky leaps and in this case a prediction of (say) is not bad, even though the absolute error is high; the forecast value still indicates alert status. Concerning decision making for traffic restrictions, we have to concentrate on avoiding errors leading to unjustified measures. In order to incorporate these specific requirements we used—besides standard measures like correlation or mean squared error—a quality function assigning a meaningful rating to each pair of observation and forecast value.
In the next section we describe the sites, databases and input parameters. Section 3 contains the methodology of our study and quality issues are discussed in Section 4. Results of test runs are analyzed in Section 5 and in the final section we summarize our findings and express our conclusions.
Section snippets
Sites and database
Our investigations took place of sites within the cities of Bolzano, Klagenfurt and Graz. The three cities are located in basin areas south of the main Alpine crest and show similar climatical characteristics. Rain clouds from the Atlantic are kept away by the Alps implying low precipitation and low wind velocities during the cold period. Furthermore, stationary temperature inversions frequently occur at that time of year (due to the basin location). Some key data, listed in Table 1, Table 2,
Multiple linear regression
Our goal is to model the daily PM10 averages by a linear model which will be the basis for our forecasting models. The regression models are conceived for the dependent variable . A square root transformation is necessary to assure a constant error variance and to avoid a violation of the model assumptions. It should be noted that the standard transformation used in this context is rather the log-transformation (see e.g. Lonati et al., 2006). However, in our case the model diagnostics showed
The forecasting models
The multiple linear regression models obtained by means of the procedure described in Section 3.1, are now used as the basis for the forecasting models. As a first step we used historical data to compute the estimates for the linear model. This can be done with any standard statistics package (we employed SPSS 14.0) and yields an estimate for . The simplest way to get an estimate for PM10 is to take the square of the predicted . The resulting bias is negligible for our purposes (and
Quality function
To measure the quality of our forecasts we need a reasonable rating system different from usual measures as e.g. the absolute error which is large if the forecast is and the observation is . However, this forecast value will indicate a right decision when partial traffic regulation measures are taken whenever (e.g. ). Contrarily, if the prediction is and the observation is a traffic regulation may not be justified and the relatively
Theoretical performance
The investigations below are based on all available data for the cold periods at the sites BT1, KT1 and GT1. To check the theoretical performance of our models, we computed the coefficient estimates by excluding the last cold period available, which shall be our test period. Then we calculated the daily PM10 predictions of the test period with these estimated coefficients. Instead of the forecasted () we used the measured values , referred to as “exact” meteorological
Meteorological forecasts
In Graz, the daily meteorological forecasts delivered by the ZAMG Styria started on 15 December 2004 and includes the variables wind, and prec. The first winter period 2004/2005 was used as pilot study to test the performance of the model with actual weather forecasts. The quantitative forecast of the wind velocities was a particularly difficult task for the meteorologists. At the beginning of the test run the scale was systematically too high. The reason for this was that the test point
Quality in operational mode
During our three test periods in Graz (15 December 2004 to 31 March 2005; 17 October 2005 to 31 March 2006; 2 November 2006 to 31 March 2007) we fed our linear models with the meteorological forecasts of wind, prec and . Before each season the parameters of the models were estimated anew from the updated data. As a matter of fact replacing the observed by the forecasted values causes an additional error and the question is how reliable the model remains. Fig. 4 compares the observed and
Summary and conclusion
We show that PM10 forecasting models based on linear regression for Bolzano, Klagenfurt and Graz provide suitable results. Due to the simple and transparent character of the model we find that more complicated black box approaches are not necessary in this case. The input variables are selected in order to represent both meteorological and anthropogenic parameters. In general, we lay special emphasis on the practical performance and the treatability of the model. For the operational mode it is
References (11)
- et al.
AUPHEP—Austrian project on health effects of particulates—general overview
Atmospheric Environment
(2004) - et al.
A neuronal network forecast for daily average PM10 concentrations in Belgium
Atmospheric Environment
(2005) - et al.
The role of traffic emissions from weekends’ and weekdays’ fine PM data in Milan
Atmospheric Environment
(2006) - et al.
Prediction of maximum of 24-h average of PM10 concentrations 30 h in advance in Santiago, Chile
Atmospheric Environment
(2002) Council Directive 1999/39/EC of 22 April 1999 regulating to limit values for sulphur dioxide, nitrogen dioxide and oxides of nitrogen, particulate matter and lead in ambient air
Official Journal of the European Communities
(1999)
Cited by (111)
Spatiotemporal informer: A new approach based on spatiotemporal embedding and attention for air quality forecasting
2023, Environmental PollutionAir quality prediction using spatio-temporal deep learning
2022, Atmospheric Pollution ResearchForecasting: theory and practice
2022, International Journal of ForecastingA short-distance healthy route planning approach
2022, Journal of Transport and HealthAir pollution forecasting with multivariate interval decomposition ensemble approach
2021, Atmospheric Pollution Research
- ☆
Research supported by the EU-Life-project KAPA GS LIFE04 ENV/AT/000006.