Abstract
Groundwater plays a crucial role in sustaining industrial and agricultural production and meeting the water demands of the growing population in the semi-arid Guanzhong Basin of China. The objective of this study was to evaluate the groundwater potential of the region through the use of GIS-based ensemble learning models. Fourteen factors, including landform, slope, slope aspect, curvature, precipitation, evapotranspiration, distance to fault, distance to river, road density, topographic wetness index, soil type, lithology, land cover, and normalized difference vegetation index, were considered. Three ensemble learning models, namely random forest (RF), extreme gradient boosting (XGB), and local cascade ensemble (LCE), were trained and cross-validated using 205 sets of samples. The models were then applied to predict groundwater potential in the region. The XGB model was found to be the best, with an area under the curve (AUC) value of 0.874, followed by the RF model with an AUC of 0.859, and the LCE model with an AUC of 0.810. The XGB and LCE models were more effective than the RF model in discriminating between areas of high and low groundwater potential. This is because most of the RF model’s prediction outcomes were concentrated in moderate groundwater potential areas, indicating that RF is less decisive when it comes to binary classification. In areas predicted to have very high and high groundwater potential, the proportions of samples with abundant groundwater were 33.6%, 69.31%, and 52.45% for RF, XGB, and LCE, respectively. In contrast, in areas predicted to have very low and low groundwater potential, the proportions of samples without groundwater were 57.14%, 66.67%, and 74.29% for RF, XGB, and LCE, respectively. The XGB model required the least amount of computational resources and achieved the highest accuracy, making it the most practical option for predicting groundwater potential. The results can be useful for policymakers and water resource managers in promoting the sustainable use of groundwater in the Guanzhong Basin and other similar regions.
Similar content being viewed by others
Availability of data and materials
The datasets used or analyzed and the Python code during the current study are available from the corresponding author on reasonable request.
References
Ahmad, I., Dar, M. A., Fenta, A., et al. (2021). Spatial configuration of groundwater potential zones using OLS regression method. Journal of African Earth Sciences, 177,104147. https://doi.org/10.1016/j.jafrearsci.2021.104147
Ahmad, I., Hasan, H., Jilani, M. M., & Ahmed, S. I. (2023). Mapping potential groundwater accumulation zones for Karachi city using GIS and AHP techniques. Environmental Monitoring and Assessment, 195, 381. https://doi.org/10.1007/s10661-023-10971-x
Al-Abadi, A. M., Al-Temmeme, A. A., & Al-Ghanimy, M. A. (2016). A GIS-based combining of frequency ratio and index of entropy approaches for mapping groundwater availability zones at Badra–Al Al-Gharbi–Teeb areas, Iraq. Sustain Water Resour Manag, 2, 265–283. https://doi.org/10.1007/s40899-016-0056-5
Anand, B., Karunanidhi, D., & Subramani, T. (2021). Promoting artificial recharge to enhance groundwater potential in the lower Bhavani River basin of South India using geospatial techniques. Environmental Science and Pollution Research, 28, 18437–18456. https://doi.org/10.1007/s11356-020-09019-1
Arabameri, A., Pal, S. C., Rezaie, F., et al. (2021). Modeling groundwater potential using novel GIS-based machine-learning ensemble techniques. Journal of Hydrology: Regional Studies, 36, 100848. https://doi.org/10.1016/j.ejrh.2021.100848
Arabameri, A., Rezaei, K., Cerda, A., et al. (2019). GIS-based groundwater potential mapping in Shahroud plain, Iran. A comparison among statistical (bivariate and multivariate), data mining and MCDM approaches. Science of the Total Environment, 658, 160–177. https://doi.org/10.1016/j.scitotenv.2018.12.115
Arefin, R. (2020). Groundwater potential zone identification using an analytic hierarchy process in Dhaka City. Bangladesh. Environ Earth Sci, 79, 268. https://doi.org/10.1007/s12665-020-09024-0
Bei, N., Xiao, B., Meng, N., & Feng, T. (2016). Critical role of meteorological conditions in a persistent haze episode in the Guanzhong basin, China. Science of the Total Environment, 550, 273–284. https://doi.org/10.1016/j.scitotenv.2015.12.159
Bera, A., Mukhopadhyay, B. P., Chowdhury, P., et al. (2021). Groundwater vulnerability assessment using GIS-based DRASTIC model in Nangasai River basin, India with special emphasis on agricultural contamination. Ecotoxicology and Environmental Safety, 214, 112085. https://doi.org/10.1016/j.ecoenv.2021.112085
Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32. https://doi.org/10.1023/A:1010933404324
Chen, M. (1986). Regional characteristics and assessment of groundwater resource in China. Journal of Natural Resources, 1, 18–27. https://doi.org/10.11849/zrzyxb.1986.01.004
Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. Association for Computing Machinery, New York, pp 785–794.
Chen, W., Panahi, M., Khosravi, K., et al. (2019a). Spatial prediction of groundwater potentiality using ANFIS ensembled with teaching-learning-based and biogeography-based optimization. Journal of Hydrology, 572, 435–448. https://doi.org/10.1016/j.jhydrol.2019.03.013
Chen, W., Tsangaratos, P., Ilia, I., et al. (2019b). Groundwater spring potential mapping using population-based evolutionary algorithms and data mining methods. Science of the Total Environment, 684, 31–49. https://doi.org/10.1016/j.scitotenv.2019.05.312
Cui, Y., & Shao, J. (2005). The role of ground water in arid/semiarid ecosystems, Northwest China. Groundwater, 43, 471–477. https://doi.org/10.1111/j.1745-6584.2005.0063.x
Díaz-Alcaide, S., & Martínez-Santos, P. (2019). Review: Advances in groundwater potential mapping. Hydrogeology Journal, 27, 2307–2324. https://doi.org/10.1007/s10040-019-02001-3
Doke, A. B., Zolekar, R. B., Patel, H., & Das, S. (2021). Geospatial mapping of groundwater potential zones using multi-criteria decision-making AHP approach in a hardrock basaltic terrain in India. Ecological Indicators, 127, 107685. https://doi.org/10.1016/j.ecolind.2021.107685
Duan, H., Deng, Z., Deng, F., & Wang, D. (2016). Assessment of groundwater potential based on multicriteria decision making model and decision tree algorithms. Mathematical Problems in Engineering, 2016, 2064575. https://doi.org/10.1155/2016/2064575
Farhat, B., Souissi, D., Mahfoudhi, R., et al. (2023). GIS-based multi-criteria decision-making techniques and analytical hierarchical process for delineation of groundwater potential. Environmental Monitoring and Assessment, 195, 285. https://doi.org/10.1007/s10661-022-10845-8
Fauvel, K., Fromont, E., Masson, V., et al. (2022). XEM: An explainable-by-design ensemble method for multivariate time series classification. Data Mining and Knowledge Discovery, 36, 917–957. https://doi.org/10.1007/s10618-022-00823-6
Fick, S. E., & Hijmans, R. J. (2017). WorldClim2: New 1-km spatial resolution climate surfaces for global land areas. International Journal of Climatology, 37, 4302–4315. https://doi.org/10.1002/joc.5086
Gislason, P. O., Benediktsson, J. A., & Sveinsson, J. R. (2006). Random forests for land cover classification. Pattern Recognition Letters, 27, 294–300. https://doi.org/10.1016/j.patrec.2005.08.011
Golkarian, A., Naghibi, S. A., Kalantar, B., & Pradhan, B. (2018). Groundwater potential mapping using C5.0, random forest, and multivariate adaptive regression spline models in GIS. Environmental Monitoring and Assessment, 190, 149. https://doi.org/10.1007/s10661-018-6507-8
Han, J., Wang, J., Chen, L., et al. (2021). Driving factors of desertification in Qaidam Basin, China: An 18-year analysis using the geographic detector model. Ecological Indicators, 124, 107404. https://doi.org/10.1016/j.ecolind.2021.107404
Han, K., Zuo, R., Ni, P., et al. (2020). Application of a genetic algorithm to groundwater pollution source identification. Journal of Hydrology, 589, 125343. https://doi.org/10.1016/j.jhydrol.2020.125343
Ibrahem Ahmed Osman, A., Najah Ahmed, A., Chow, M. F., et al. (2021). Extreme gradient boosting (Xgboost) model to predict the groundwater levels in Selangor Malaysia. Ain Shams Engineering Journal, 12, 1545–1556. https://doi.org/10.1016/j.asej.2020.11.011
Jansen, J. (2019). Drone based geophysical surveys for groundwater applications. In: 2019 groundwater week.
Jhariya, D. C., Khan, R., Mondal, K. C., et al. (2021). Assessment of groundwater potential zone using GIS-based multi-influencing factor (MIF), multi-criteria decision analysis (MCDA) and electrical resistivity survey techniques in Raipur city, Chhattisgarh, India. J Water Supply Res Technol-Aqua, 70, 375–400. https://doi.org/10.2166/aqua.2021.129
Jia, S., Zhu, W., Lű, A., & Yan, T. (2011). A statistical spatial downscaling algorithm of TRMM precipitation based on NDVI and DEM in the Qaidam basin of China. Remote Sensing of Environment, 115, 3069–3079. https://doi.org/10.1016/j.rse.2011.06.009
Jin, X., Guo, R., & Xia, W. (2013). Distribution of actual evapotranspiration over Qaidam basin, an arid area in China. Remote Sensing, 5, 6976–6996. https://doi.org/10.3390/rs5126976
Kaur, A., & Sood, S. K. (2020). Deep learning based drought assessment and prediction framework. Ecological Informatics, 57, 101067. https://doi.org/10.1016/j.ecoinf.2020.101067
Kong, F., Song, J., Zhang, Y., et al. (2019). Surface water-groundwater interaction in the Guanzhong section of the Weihe river basin, China. Groundwater, 57, 647–660. https://doi.org/10.1111/gwat.12854
Li, M., Sun, H., Singh, V. P., et al. (2019). Agricultural water resources management using maximum entropy and entropy-weight-based TOPSIS methods. Entropy, 21, 364. https://doi.org/10.3390/e21040364
Mo, S., Zabaras, N., Shi, X., & Wu, J. (2020). Integration of adversarial autoencoders with residual dense convolutional networks for estimation of non-Gaussian hydraulic conductivities. Water Resources Research, 56. https://doi.org/10.1029/2019WR026082
Naghibi, S. A., Pourghasemi, H. R., & Dixon, B. (2015a). GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran. Environmental Monitoring and Assessment, 188, 44. https://doi.org/10.1007/s10661-015-5049-6
Naghibi, S. A., Pourghasemi, H. R., Pourtaghi, Z. S., & Rezaei, A. (2015). Groundwater qanat potential mapping using frequency ratio and Shannon’s entropy models in the Moghan watershed Iran. Earth Science Informatics, 8, 171–186. https://doi.org/10.1007/s12145-014-0145-7
Panahi, M., Sadhasivam, N., Pourghasemi, H. R., et al. (2020). Spatial prediction of groundwater potential mapping based on convolutional neural network (CNN) and support vector regression (SVR). Journal of Hydrology, 588, 125033. https://doi.org/10.1016/j.jhydrol.2020.125033
Paul, A., Mukherjee, D. P., Das, P., et al. (2018). Improved random forest for classification. IEEE Transactions on Image Processing, 27, 4012–4024. https://doi.org/10.1109/TIP.2018.2834830
Pedregosa, F., Varoquaux, G., Gramfort, A., et al. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.
Pham, B. T., Jaafari, A., Phong, T. V., et al. (2021). Naïve Bayes ensemble models for groundwater potential mapping. Ecological Informatics, 64,
Rasool, U., Yin, X., Xu, Z., et al. (2022). Mapping of groundwater productivity potential with machine learning algorithms: A case study in the provincial capital of Baluchistan, Pakistan. Chemosphere, 303, 135265. https://doi.org/10.1016/j.chemosphere.2022.135265
Razandi, Y., Pourghasemi, H. R., Neisani, N. S., & Rahmati, O. (2015). Application of analytical hierarchy process, frequency ratio, and certainty factor models for groundwater potential mapping using GIS. Earth Science Informatics, 8, 867–883. https://doi.org/10.1007/s12145-015-0220-8
Reichstein, M., Camps-Valls, G., Stevens, B., et al. (2019). Deep learning and process understanding for data-driven Earth system science. Nature, 566, 195–204. https://doi.org/10.1038/s41586-019-0912-1
Ren, X., Li, P., He, X., et al. (2021). Hydrogeochemical processes affecting groundwater chemistry in the central part of the Guanzhong basin, China. Archives of Environmental Contamination and Toxicology, 80, 74–91. https://doi.org/10.1007/s00244-020-00772-5
Rizeei, H. M., Pradhan, B., Saharkhiz, M. A., & Lee, S. (2019). Groundwater aquifer potential modeling using an ensemble multi-adoptive boosting logistic regression technique. Journal of Hydrology, 579, 124172. https://doi.org/10.1016/j.jhydrol.2019.124172
Sexton, J., & Laake, P. (2009). Standard errors for bagged and random forest estimators. Computational Statistics & Data Analysis, 53, 801–811. https://doi.org/10.1016/j.csda.2008.08.007
Shamsudduha, M., & Taylor, R. G. (2020). Groundwater storage dynamics in the world’s large aquifer systems from GRACE: Uncertainty and role of extreme precipitation. Earth System Dynamics, 11, 755–774. https://doi.org/10.5194/esd-11-755-2020
Singh, S. K., Zeddies, M., Shankar, U., & Griffiths, G. A. (2019). Potential groundwater recharge zones within New Zealand. Geoscience Frontiers, 10, 1065–1072. https://doi.org/10.1016/j.gsf.2018.05.018
Sørensen, R., & Seibert, J. (2007). Effects of DEM resolution on the calculation of topographical indices: TWI and its components. Journal of Hydrology, 347, 79–89. https://doi.org/10.1016/j.jhydrol.2007.09.001
Sun, A. Y., Scanlon, B. R., Zhang, Z., et al. (2019). Combining physically based modeling and deep learning for fusing GRACE satellite data: Can we learn from mismatch? Water Resources Research, 55, 1179–1195. https://doi.org/10.1029/2018WR023333
Sun, X., Zhou, Y., Yuan, L., et al. (2021). Integrated decision-making model for groundwater potential evaluation in mining areas using the cusp catastrophe model and principal component analysis. Journal of Hydrology: Regional Studies, 37,
Tegegne, A. M. (2022). Applications of convolutional neural network for classification of land cover and groundwater potentiality zones. Journal of Engineering, 2022, 6372089. https://doi.org/10.1155/2022/6372089
Velis, M., Conti, K. I., & Biermann, F. (2017). Groundwater and human development: Synergies and trade-offs within the context of the sustainable development goals. Sustainability Science, 12, 1007–1017. https://doi.org/10.1007/s11625-017-0490-9
Wang, Y., Guo, H., Li, J., et al. (2008). Investigation and assessment of groundwater resources and their environmental issues in the Qaidam basin. Geology Press.
Wang, Z., Liu, Q., & Liu, Y. (2020). Mapping landslide susceptibility using machine learning algorithms and GIS: A case study in Shexian county, Anhui province, china. Symmetry-Basel, 12, 1954. https://doi.org/10.3390/sym12121954
Wang, Z., Wang, J., & Han, J. (2022). Spatial prediction of groundwater potential and driving factor analysis based on deep learning and geographical detector in an arid endorheic basin. Ecological Indicators, 142, 109256. https://doi.org/10.1016/j.ecolind.2022.109256
Xu, P., Zhang, Q., Qian, H., et al. (2019). Characterization of geothermal water in the piedmont region of Qinling mountains and Lantian-Bahe group in Guanzhong Basin China. Environmental Earth Sciences, 78, 442. https://doi.org/10.1007/s12665-019-8418-6
Zaree, M., Javadi, S., & Neshat, A. (2019). Potential detection of water resources in karst formations using APLIS model and modification with AHP and TOPSIS. Journal of Earth System Science, 128, 76. https://doi.org/10.1007/s12040-019-1119-4
Zhang, Q., Li, P., Lyu, Q., et al. (2022). Groundwater contamination risk assessment using a modified DRATICL model and pollution loading: A case study in the Guanzhong basin of China. Chemosphere, 291, 132695. https://doi.org/10.1016/j.chemosphere.2021.132695
Zhang, Y., Jia, R., Wu, J., et al. (2021). Evaluation of groundwater using an integrated approach of entropy weight and stochastic simulation: A case study in East Region of Beijing. International Journal of Environmental Research and Public Health, 18, 7703. https://doi.org/10.3390/ijerph18147703
Zhou, H., Gómez-Hernández, J. J., & Li, L. (2014). Inverse methods in hydrogeology: Evolution and recent trends. Advances in Water Resources, 63, 22–37. https://doi.org/10.1016/j.advwatres.2013.10.014
Funding
The study was supported cooperatively by Second Tibetan Plateau Scientific Expedition and Research Program (2019QZKK0805-02), Key deployment projects of the Chinese academy of sciences (ZDRW-ZS-2020–3), National Natural Science Foundation of China (U20A2088), Innovation Team Foundation of Qinghai Office of Science and Technology (2022-ZJ-903), and Kunlun Talented People of Qinghai Province, High-end Innovation and Entrepreneurship talents-Leading Talents (E140DZ3901).
Author information
Authors and Affiliations
Contributions
Zitao Wang contributed to the study’s methodology, performed formal analysis, drafted the original manuscript, and created visualizations. Jianping Wang conceptualized the study, reviewed and edited the manuscript. Dongmei Yu was responsible for data curation. Kai Chen contributed to the validation of the study’s findings. All authors reviewed and approved the final manuscript for submission.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Highlights
• The groundwater potential in the Guanzhong Basin was assessed using four ensemble learning models: Random Forest (RF), Extreme Gradient Boosting (XGB), and Local Cascade Ensemble (LCE).
• The XGB model was found to be the most accurate, with an AUC value of 0.874, followed by the RF model with an AUC of 0.859, and the LCE model with an AUC of 0.810.
• The XGB model required the least amount of computational resources and achieved the highest accuracy, making it the most practical option for predicting groundwater potential in the study area.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, Z., Wang, J., Yu, D. et al. Groundwater potential assessment using GIS-based ensemble learning models in Guanzhong Basin, China. Environ Monit Assess 195, 690 (2023). https://doi.org/10.1007/s10661-023-11388-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10661-023-11388-2