Abstract
Many impact studies require climate change information at a finer resolution than that provided by global climate models (GCMs). This paper investigates the performances of existing state-of-the-art rule induction and tree algorithms, namely single conjunctive rule learner, decision table, M5 model tree, and REPTree, and explores the impact of climate change on maximum and minimum temperatures (i.e., predictands) of 14 meteorological stations in the Upper Thames River Basin, Ontario, Canada. The data used for evaluation were large-scale predictor variables, extracted from National Centers for Environmental Prediction/National Center for Atmospheric Research reanalysis dataset and the simulations from third generation Canadian coupled global climate model. Data for four grid points covering the study region were used for developing the downscaling model. M5 model tree algorithm was found to yield better performance among all other learning techniques explored in the present study. Hence, this technique was applied to project predictands generated from GCM using three scenarios (A1B, A2, and B1) for the periods (2046–2065 and 2081–2100). A simple multiplicative shift was used for correcting predictand values. The potential of the downscaling models in simulating predictands was evaluated, and downscaling results reveal that the proposed downscaling model can reproduce local daily predictands from large-scale weather variables. Trend of projected maximum and minimum temperatures was studied for historical as well as downscaled values using GCM and scenario uncertainty. There is likely an increasing trend for T max and T min for A1B, A2, and B1 scenarios while decreasing trend has been observed for B1 scenarios during 2081–2100.






Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Amor VMI, James WH (2006) Bias correction of daily GCM rainfall for crop simulation studies. Agric For Meteorol 138:44–53
Anandhi A, Srinivas VV, Kumar DN, Nanjundiah RS (2009) Role of predictors in downscaling surface temperature to river basin in India for IPCC SRES scenarios using support vector machine. Int J Climatol 29:583–603
Arora M, Goel NK, Singh P (2005) Evaluation of temperature trends over India. Hydrol Sci J 50(1):1–93
Bardossy A, Bogardi I, Matyasovszky I (2005) Fuzzy rule-based downscaling of precipitation. Theor Appl Climatol 82:119–129
Bhattacharya B, Solomatine DP (2005) Neural networks and M5 model trees in modelling water level–discharge relationship. Neurocomputing 63:381–396
Bhattacharya B, Price RK, Solomatine DP (2007) Machine learning approach to modeling sediment transport. J Hydraul Eng 133(4):440–450
Breiman L (1984) Classification and regression trees. Chapman & Hall/CRC, Boca Raton, p 368
Burn HB, Hag Elnur MA (2002) Detection of hydrologic trends and variability. J Hydrol 255:107–122
Cannon AJ (2011) Quantile regression neural networks: implementation in R and application to precipitation downscaling. Comput Geosci 37:1277–1284
Cannon AJ, Lord ER (2000) Forecasting summertime surface-level ozone concentrations in the Lower Fraser Valley of British Columbia. An ensemble neural network approach. J Air Waste Manage Assoc 50:322–339
Cannon AJ, Whitfield PH (2002) Downscaling recent streamflow conditions in British Columbia, Canada using ensemble neural network models. J Hydrol 259:136–151
Chu JT, Xia J, Xu CY, Singh VP (2010) Statistical downscaling of daily mean temperature, pan evaporation and precipitation for climate change scenarios in Haihe River, China. Theor Appl Climatol 99:149–161
Clark P, Niblett T (1989) The CN2 rule induction algorithm. Mach Learn 3:261–284
Cohen W (1995) Fast effective rule induction. In: Proceedings 12th International Conference on Machine Learning. Morgan Kaufmann, San Francisco, pp 115–123
Conway D, Wilby RL, Jones PD (1996) Precipitation and air flow indices over the British Isles. Clim Res 7:169–183
Coulibaly P (2004) Downscaling daily extreme temperatures with genetic programming. Geophys Res Lett 31:L16203. doi:10.1029/2004GL020075
Coulibaly P, Dibike YB, Anctil F (2005) Downscaling precipitation and temperature with temporal neural networks. J Hydrometeorol 6(4):483–496
Daud MNR, Corne DW (2007) Human readable rule induction in medical data mining: a survey of existing algorithms. In: WSEAS European Computing Conference, 2007, Athens, Greece
Douglas EM, Vogel RM, Kroll CN (2000) Trends in floods and low flows in the United States: impact of spatial correlation. J Hydrol 240:90–105
Dubrovsky M, Buchtele J, Zalud Z (2004) High-frequency and low frequency variability in stochastic daily weather generator and its effect on agricultural and hydrologic modelling. Clim Chang 63:145–179
Shahidi AE, Mahjoobi J (2009) Comparison between M5 model tree and neural networks for prediction of significant wave height in Lake Superior. Ocean Eng 36:1175–1181
Fowler HJ, Wilby RL (2007) Beyond the downscaling comparison study. Int J Climatol 27:1543–1545
Fowler HJ, Kilsby CG, O’Connell PE (2000) A stochastic rainfall model for the assessment of regional water resource systems under changed climatic conditions. Hydrol Earth Syst Sci 4:261–280
Fowler HJ, Blenkinsop S, Tebaldi C (2007) Linking climate change modelling to impacts studies: recent advances in downscaling techniques for hydrological modelling. Int J Climatol 27:1547–1578
Gachon P, Dibike Y (2007) Temperature change signals in northern Canada: convergence of statistical downscaling results using two driving GCMs. Int J Climatol 27:1623–1641
Gangopadhyay SM, Clark B, Rajagopalan (2005) Statistical downscaling using K-nearest neighbors. Water Resour Res 41:W02024. doi:10.1029/2004WR003444
Gardner MW, Dorling SR (1998) Artificial neural networks (the multi layer perceptron)—a review of applications in the atmospheric sciences. Atmos Environ 32:2627–2636
Giorgi F, Hewitson BC (2001) Regional climate information—evaluation and projections. In: Houghton JT, Ding Y, Griggs DJ, Noguer M, van der Linden PJ, Dia X, Maskell K, Johnson CA (eds) Climate change 2001: the scientific basis. Cambridge University Press, Cambridge
Goodess CM, Palutikof J (1998) Development of daily rainfall scenarios for southeast Spain using a circulation-type approach to downscaling. Int J Climatol 18:1051–1083
Grotch SL, MacCracken MC (1991) The use of general circulation models to predict regional climatic change. J Clim 4:286–303
Goyal MK and Ojha CSP (2010a) Robust Weighted Regression As A Downscaling Tool In Temperature Projections International Journal of Global Warming, Interscience Publishers, UK, 2(3):234–251
Goyal MK and Ojha CSP (2010b) Evaluation of Various Linear Regression Methods for Downscaling of Mean Monthly Precipitation in Arid Pichola Watershed Natural Resources, Scientific Research, USA, 1(1):11–18 doi:10.4236/nr.2010.11002
Goyal MK and Ojha CSP (2010c) Application of PLS-Regression as downscaling tool for Pichola lake basin in India International Journal of Geosciences, Scientific Research, USA 1:51–57, doi:10.4236/ijg.2010.12007
Goyal MK, Ojha CSP and Burn DH (2011) Nonparametric Statistical Downscaling of Temperature, Precipitation and Evaporation for Semi-Arid Region in India, ASCE-Journal of Hydrologic Engg. doi:10.1061/(ASCE)HE.1943-5584.0000479
Goyal MK and Ojha CSP (2011a) PLS regression based Pan evaporation and Minimum-Maximum Temperature projections for an arid lake basin in India Theoretical and Applied Climatology, Springer Netherlands, 105(3):403–415. doi:10.1007/s00704-011-0406-z
Goyal MK and Ojha CSP (2011b) Downscaling of Surface Temperature for Lake Catchment in Arid Region in India using Linear Multiple Regression and Neural Networks International Journal of Climatology, John Wiley & Sons. doi:10.1002/joc.2286
Goyal MK and Ojha CSP (2011c) Evaluation of Linear Regression Methods As Downscaling Tool in Temperature Projections Over Pichola lake Basin in India Hydrological Processes, John Wiley & Sons, 25(9):1453–1465. doi:10.1002/hyp.7911
Goyal MK and Ojha CSP (2011d) Estimation of Scour Downstream of a Ski—Jump Bucket Using Support Vector and M5 Model Tree, Water Resources Management, Springer Netherlands, 25(9):2177–2195. doi:10.1007/s11269-011-9801-6
Goyal MK and Ojha CSP (2011e) Downscaling of Precipitation on a Lake Basin: Evaluation of Rule and Decision Tree Induction Algorithms, Hydrology Research vol. 43 (In Press).
Hessami M, Gachon P, Ouarda Taha BMJ, St-Hilaire A (2008) Automated regression-based statistical downscaling tool. Environ Model Softw 23(6):813–834
Hewitson BC, Crane RG (eds) (1994) Neural nets applications in geography. Kluwer Academic, Dordrecht
Heyen H, Zorita E, von Storch H (1996) Statistical downscaling of monthly mean North Atlantic air-pressure to sea level anomalies in the Baltic Sea. Tellus 48A:312–323
Huth R (2004) Sensitivity of local daily temperature change estimates to the selection of downscaling models and predictors. J Clim 17:640–652
Ibrahim F, Abu Osman NA, Usman J, Kadri NA (eds) (2006) Biomed 06, IFMBE Proceedings 15: 520–523, 2007
Intergovernmental Panel on Climate Change (IPCC) (2007) Climate change 2007—the physical science basis. Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change. In: Solomon S, Qin D, Manning M, Marquis M, Averyt K, Tignor MMB, Miller HLR Jr, Chen Z (eds). Cambridge University Press, Cambridge. IPCC Data Distribution Centre (DDC). http://www.mad.zmaw.de/IPCC_DDC/html/ddc_gcmdata.html
IPCC (2001) In: Houghton JT et al (eds) The scientific basis. Contribution of WGI to the Third Assessment Report of the Intergovernmental panel on Climate Change. Cambridge University Press, Cambridge, p 881
Kalnay E, Kanamitsu M, Kistler R, Collins W, Deaven D, Gandin L, Iredell M, Saha S, White G, Woollen J, Zhu Y, Chelliah M, Ebisuzaki W, Higgins W, Janowiak J, Mo KC, Ropelewski C, Wang J, Leetmaa A, Reynolds R, Jenne R, Joseph D (1996) The NCEP/NCAR 40-year reanalysis project. Bull Am Meteorol Soc 77(3):437–471
Kang HW, An KH, Park CK, Solis ALS, Stitthichivapak K (2007) Multimodel output statistical downscaling prediction of precipitation in the Philippines and Thailand. Geophys Res Lett 34:L15710. doi:10.1029/2007GL030730
Kendall MG (1975) Rank correlation methods. Charles Griffin, London, p 202
Kilsby CG, Cowpertwait PSP, O’Connell PE, Jones PD (1998) Predicting rainfall statistics in England and Wales using atmospheric circulation variables. Int J Climatol 18:523–539
King L, Solaiman T, Simonovic SP (2009) Assessment of climatic vulnerability in the Upper Thames River Basin. Water Resources Research Report no. 064. Facility for Intelligent Decision Support, Department of Civil and Environmental Engineering, London, p 62
Kite GW (1997) Simulating Columbia river flows with data from regional-scale climate models. Water Resour Res 33(6):1275–1285
Kohavi R (1995) The power of decision tables. In: Machine learning, pp 174–189
Mann HB (1945) Nonparametric tests against trend. Econometrica 13:245–259
Mason SJ (2004) Simulating climate over Western North America using stochastic weather generators. Clim Chang 62:155–187
Nilsson JN (1999) Introduction to machine learning. California, United Stated of Americas
Ojha CSP., Goyal MK and Adeloye AJ (2010) Downscaling of Precipitation for Lake Catchment in Arid Region in India using Linear Multiple Regression and Neural Networks The Open Journal of Hydrology, Bentham Science Publishers, 2010, 4, 122–136.
Othman MFb, Moh TSY (2007) Comparison of different classification techniques using WEKA for breast cancer. Biomed 06: IFMBE proceedings 15, pp 520–523
Prodanovic P, Simonovic SP (2007) Integrated water resources modelling of the Upper Thames River Basin. In: 18th Canadian Hydrotechnical Conference—Challenges for Water Resources Engineering in a Changing World, Winnipeg, Manitoba, August 22–24
Quinlan JR (1992) Learning with continuous classes. In: Proc. of the Fifth Australian Joint Conference on Artificial Intelligence. World Scientific, Singapore, pp 343–348
Quinlan JR (1996) Improved use of continuous attributes in C4.5. J Artif Intell Res 4:77–90
Salathé EP Jr (2003) Comparison of various precipitation downscaling methods for the simulation of streamflow in a rainshadow river basin. Int J Climatol 23:887–901
Salathé EP Jr, Mote Philip W, Wiley Matthew W (2007) Review of scenario selection and downscaling methods for the assessment of climate change impacts on hydrology in the United States pacific northwest. Int J Climatol 27:1611–1621
Schoof JT, Pryor SC (2001) Downscaling temperature and precipitation: a comparison of regression-based methods and artificial neural networks. Int J Climatol 21:773–790
Senthil kumar AR, Ojha CSP, Goyal MK, Singh RD, Swamee PK (2011), Modelling of Suspended Sediment Concentration at Kasol in India using ANN, Fuzzy Logic and Decision Tree Algorithms, ASCE's Journal of Hydrologic Engineering. doi:10.1061/(ASCE)HE.1943-5584.0000445
Shannon DA, Hewitson BC (1996) Cross-scale relationships regarding local temperature inversions at Cape Town and global climate change implications. S Afr J Sci 92(4):213–216
Sharif M, Burn DH (2006) Simulating climate change scenarios using an improved K-nearest neighbor model. J Hydrol 325:179–196
Sharif M, Burn DH (2007) An improved K-nearest neighbor weather generating model. J Hydrol Eng 12(1):42–51
Solaiman T, Simonovic SP (2010) Assessing NCEP–NCAR hydroclimatic data for macroscale hydrologic modelling in South-Western Ontario, Canada. Can J Civ Eng 37:611–623
Solomatine DP, Xue Y (2004) M5 model trees and neural networks: application to flood forecasting in the upper reach of the Huai River in China. J Hydrol Eng 9(6):491–501
Solomatine DP, Dulal KN (2003) Model tree as an alternative to neural network in rainfall–runoff modelling. Hydrol Sci J 48(3):399–411
Tripathi S, Srinivas VV, Nanjundiah RS (2006) Downscaling of precipitation for climate change scenarios: a support vector machine approach. J Hydrol 330(3–4):621–640
Wang Y, Witten IH (1997) Induction of model trees for predicting continuous lasses. In: Proceedings of the Poster Papers of the European Conference on Machine Learning. University of Economics, Faculty of Informatics and Statistics, Prague
Wang YQ, Leung LR, Mcgregor JL, Wang WC, Ding YH, Kimura F (2004) Regional climate modeling: progress, challenges, and prospects. J Meteorol Soc Jpn 82(6):1599–1628
Wetterhall F, Halldin S, Xu C-Y (2005) Statistical precipitation downscaling in central Sweden with the analogue method. J Hydrol 306:174–190
Wilby RL, Harris I (2006) A framework for assessing uncertainties in climate change impacts: low-flow scenarios. Water Resour Res 42:W02419. doi:10.1029/2005WR004065
Wilby RL, Wigley TML (1997) Downscaling general circulation model output: a review of methods and limitations. Prog Phys Geogr 21:530–548
Wilby RL, Wigley TME (2000) Precipitation predictors for downscaling: observed and general circulation model relationships. Int J Climatol 20:641–661
Wilby RL, Wigley TML, Conway D, Jones PD, Hewitson BC, Main J, Wilks DS (1998) Statistical downscaling of general circulation model output: a comparison of methods. Water Resour Res 34:2995–3008
Wilby RL, Charles SP, Zorita E, Timbal B, Whetton P, Mearns LO (2004) Guidelines for use of climate scenarios developed from statistical downscaling methods, available from the DDC of IPCC TGCIA, p 27. Available from: IPCC-DDC: http://www.ipcc-data.org/
Wilks DS (1989) Conditioning stochastic daily precipitation models on total monthly precipitation. Water Resour Res 25(6):1429–1439
Willmott CJ, Rowe CM, Philpot WD (1985) Small-scale climate map: a sensitivity analysis of some common assumptions associated with the grid-point interpolation and contouring. Am Cartograph 12:5–16
Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques with Java implementations. Morgan Kaufmann, San Francisco
Xoplaki E, Luterbacher J, Burkard R, Patrikas I, Maheras P (2000) Connection between the large-scale 500 hPa geopotential height fields and precipitation over Greece during wintertime. Clim Res 14:129–146
Xu C-Y (1999) From GCMs to river flow: a review of downscaling methods and hydrologic modelling approaches. Progr Phys Geogr 23(2):229–249
Acknowledgments
This work was made possible through a Canadian Commonwealth Scholarship program, awarded to the first author from the Canadian Bureau for International Education to pursue research at University of Waterloo, Waterloo, ON, Canada.
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
T max: M5 pruned model tree (using smoothed linear models)
pc1 ≤ 0.303
| pc1 ≤ −5.489: LM1 (990/29.777%)
| pc1 > −5.489: LM2 (1,233/29.451%)
pc1 > 0.303: LM3 (2,161/25.473%)
LM num 1
MaxTemp = 1.9188 × pc1 − 0.3197 × pc2 − 0.7801 × pc3 − 1.3322 × pc4 + 0.0036 × pc5 − 0.9671 × pc6 + 0.9386 × pc7 − 4.2026 × pc8 + 0.0235 × pc9 + 0.005 × pc10 + 12.4731
LM num 2
MaxTemp = 1.8496 × pc1 − 0.4515 × pc2 − 0.7399 × pc3 − 1.2969 × pc4 + 0.4277 × pc5 − 1.0828 × pc6 + 0.815 × pc7 − 2.4603 × pc8 + 1.2959 × pc9 + 0.005 × pc10 + 12.1503
LM num: 3
MaxTemp = 1.8513 × pc1 − 0.5194 × pc2 − 0.6143 × pc3 − 1.6666 × pc4 − 0.4695 × pc5 − 0.5368 × pc6 + 1.4823 × pc7 − 1.4421 × pc8 + 2.4781 × pc9 + 0.6268 × pc10+ 10.9677
Number of rules—3
T min: M5 pruned model tree (using smoothed linear models):
pc1 ≤ 1.317
| pc1 ≤ −5.439
| | pc4 ≤ 0.37: LM1 (498/30.741%)
| | pc4 > 0.37: LM2 (503/36.681%)
| pc1 > −5.439
| | pc1 ≤ −2.22
| | | pc3 ≤ 2.042: LM3 (521/30.204%)
| | | pc3 > 2.042: LM4 (186/40.401%)
| | pc1 > −2.22: LM5 (757/31.316%)
pc1 > 1.317: LM6 (1,919/27.471%)
LM num 1
MaxTemp = 1.9749 × pc1 − 0.1784 × pc2 − 1.1794 × pc3 − 1.3166 × pc4 + 1.3908 × pc5 − 0.0124 × pc6 + 0.0415 × pc7 − 2.9836 × pc8 − 0.017 × pc9 + 0.0171 × pc10 + 6.4099
LM num 2
MaxTemp = 2.815 × pc1 − 0.007 × pc2 − 2.0276 × pc3 − 1.5988 × pc4 + 1.8935 × pc5 − 0.6826 × pc6 + 0.0413 × pc7 − 5.8322 × pc8 − 1.8691 × pc9 + 0.0171 × pc10 + 12.2365
LM num 3
MaxTemp = 1.3874 × pc1 − 0.0965 × pc2 − 0.962 × pc3 − 0.5622 × pc4 + 1.6538 × pc5 − 0.4808 × pc6 + 0.0306 × pc7 − 2.4992 × pc8 − 0.5334 × pc9 + 0.0136 × pc10 + 3.7766
LM num 4
MaxTemp = 1.8142 × pc1 − 0.0157 × pc2 − 1.448 × pc3 − 2.2798 × pc4 + 1.9731 × pc5 − 0.6306 × pc6 + 1.4846 × pc7 − 2.6684 × pc8 − 1.528 × pc9 + 0.0136 × pc10 + 5.4219
LM num 5
MaxTemp = 1.2122 × pc1 − 0.1827 × pc2 − 0.95 × pc3 − 0.7975 × pc4 + 0.8704 × pc5 − 0.2823 × pc6 + 0.816 × pc7 − 1.5892 × pc8 + 0.3301 × pc9 + 0.0136 × pc10 + 3.0578
LM num 6
MaxTemp = 1.3961 × pc1 − 0.1521 × pc2 − 0.7257 × pc3 − 0.8519 × pc4 + 0.3926 × pc5 + 0.7775 × pc7 − 1.0703 × pc8 + 1.2162 × pc9 + 0.39 × pc10 + 2.9588
Number of rules—6
where pc represents principal component.
Rights and permissions
About this article
Cite this article
Goyal, M.K., Burn, D.H. & Ojha, C.S.P. Evaluation of machine learning tools as a statistical downscaling tool: temperatures projections for multi-stations for Thames River Basin, Canada. Theor Appl Climatol 108, 519–534 (2012). https://doi.org/10.1007/s00704-011-0546-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-011-0546-1