Abstract
Water pollution is a major global environmental issue. Predicting water quality parameters in advance is of utmost importance in the normal operation of society. However, existing empirical models exhibited low precision in water quality prediction due to the non-stationarity and non-linearity of the water quality series, and the performance of the long short-term memory network (LSTM) integrated with the improved meta-heuristic algorithm is still unclear and worth exploring. In this paper, we proposed a hybrid model based on the ensemble learning method that integrates the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and improved LSTM to model and forecast the variation of water quality parameters. First, the water quality series was denoised by an integrated filter based on CEEMDAN and fuzzy entropy (FE), and then, the ant lion algorithm based on chaos initialization, Cauchy mutation, and opposition-based learning (OBL) optimization (IALO) was used to determine the hyperparameters of LSTM. Weekly dissolved oxygen (DO) concentration data, from 1/2010-12/2016, collected at Yongjiang River and Beijiang River gauging stations in the Pearl River Basin were used to validate the effectiveness of the proposed model. The experimental results reveal that the proposed model has better predictive accuracy than other data-driven models because of better error performance, with average MAE, MAPE, and RMSE of 0.42, 5.37%, and 0.53 in the test stage, which is 50.24%, 47.66%, and 49.66% lower than the baseline LSTM model, respectively.
Similar content being viewed by others
Data availability
All data generated or analyzed during this study are included in this published article.
References
Abba SI, Pham QB, Saini G, Linh NTT et al (2020) Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index. Environ Sci Pollut Res 27(33):41524–41539. https://doi.org/10.1007/s11356-020-09689-x
Ahmed AN, Othman FB, Afan HA et al (2019) Machine learning methods for better water quality prediction. J Hydrol 578:124084. https://doi.org/10.1016/j.jhydrol.2019.124084
Aldhyani TH, Al-Yaari M, Alkahtani H, Maashi M (2020) Water quality prediction using artificial intelligence algorithms. Appl Bionics Biomech. https://doi.org/10.1155/2020/6659314
Banadkooki FB, Ehteram M, Ahmed AN et al (2020) Suspended sediment load prediction using artificial neural network and ant lion optimization algorithm. Environ Sci Pollut Res 27(30):38094–38116. https://doi.org/10.1007/s11356-020-09876-w
Bi J, Lin Y, Dong Q, Yuan H, Zhou M (2021) Large-scale water quality prediction with integrated deep neural network. Inf Sci 571:191–205. https://doi.org/10.1016/j.ins.2021.04.057
Chen XY, Chau KW (2019) Uncertainty analysis on hybrid double feedforward neural network model for sediment load estimation with LUBE method. Water Resour Manag 33(10):3563–3577. https://doi.org/10.1007/s11269-019-02318-4
Chen X, Feng F, Wu J, Liu W (2018) Anomaly detection for drinking water quality via deep biLSTM ensemble. In: Proceedings of the genetic and evolutionary computation conference companion (pp. 3–4). https://doi.org/10.1145/3205651.3208203
Chen Y, Song L, Liu Y, Yang L, Li D (2020) A review of the artificial neural network models for water quality prediction. Appl Sci 10(17):5776. https://doi.org/10.3390/app10175776
de Boves HP (2017) Support vector machine classification trees based on fuzzy entropy of classification. Anal Chim Acta 954:14–21. https://doi.org/10.1016/j.aca.2016.11.072
Deng W, Wang G, Zhang X (2015) A novel hybrid water quality time series prediction method based on cloud model and fuzzy forecasting. Chemom Intell Lab Syst 149:39–49. https://doi.org/10.1016/j.chemolab.2015.09.017
Dong Q, Lin Y, Bi J, Yuan H (2019) An integrated deep neural network approach for large-scale water quality time series prediction. In: 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC) (pp. 3537–3542). IEEE. https://doi.org/10.1109/SMC.2019.8914404
Fan S, Hao D, Feng Y, Xia K, Yang W (2021) A Hybrid model for air quality prediction based on data decomposition. Information 12(5):210. https://doi.org/10.3390/info12050210
Gorai AK, Hasni SA, Iqbal J (2016) Prediction of ground water quality index to assess suitability for drinking purposes using fuzzy rule-based approach. Appl Water Sci 6(4):393–405. https://doi.org/10.1007/s13201-014-0241-3
Heidari AA, Faris H, Mirjalili S, Aljarah I, Mafarja M (2020) Ant lion optimizer: theory, literature review, and application in multi-layer perceptron neural networks. Nat-Inspired Optim. https://doi.org/10.1007/978-3-030-12127-3_3
Hu Z, Zhang Y, Zhao Y, Xie M, Zhong J, Tu Z, Liu J (2019) A water quality prediction method based on the deep LSTM network considering correlation in smart mariculture. Sensors 19(6):1420. https://doi.org/10.3390/s19061420
Khadr M, Elshemy M (2017) Data-driven modeling for water quality prediction case study: the drains system associated with Manzala Lake, Egypt. Ain Shams Eng J 8(4):549–557. https://doi.org/10.1016/j.asej.2016.08.004
Kim SE, Seo IW (2015) Artificial neural network ensemble modeling with conjunctive data clustering for water quality prediction in rivers. J Hydro-Environ Res 9(3):325–339. https://doi.org/10.1016/j.jher.2014.09.006
Lan KT, Lan CH (2008) Notes on the distinction of Gaussian and Cauchy mutations. In: 2008 Eighth International Conference on Intelligent Systems Design and Applications (Vol. 1, pp. 272–277). IEEE. https://doi.org/10.1109/ISDA.2008.237
Lang X, Hu Z, Li P, Li Y, Cao J, Ren H (2018) Pipeline leak aperture recognition based on wavelet packet analysis and a deep belief network with ICR. Wirel Commun Mob Comput. https://doi.org/10.1155/2018/6934825
Li M, Wu W, Chen B, Guan L, Wu Y (2017a) Water quality evaluation using back propagation artificial neural network based on self-adaptive particle swarm optimization algorithm and chaos theory. Comput Water, Energy, Environ Eng 6(03):229. https://doi.org/10.4236/cweee.2017.63016
Li X, Cheng Z, Yu Q, Bai Y, Li C (2017b) Water-quality prediction using multimodal support vector regression: case study of Jialing River, China. J Environ Eng 143(10):04017070. https://doi.org/10.1061/(ASCE)EE.1943-7870.0001272
Li L, Jiang P, Xu H, Lin G, Guo D, Wu H (2019) Water quality prediction based on recurrent neural network and improved evidence theory: a case study of Qiantang River, China. Environ Sci Pollut Res 26(19):19879–19896. https://doi.org/10.1007/s11356-019-05116-y
Liu D, Cheng C, Fu Q et al (2018) Multifractal detrended fluctuation analysis of regional precipitation sequences based on the CEEMDAN-WPT. Pure Appl Geophys 175(8):3069–3084. https://doi.org/10.1007/s00024-018-1820-2
Liu J, Yu C, Hu Z, Zhao Y, Bai Y, Xie M, Luo J (2020) Accurate prediction scheme of water quality in smart mariculture with deep Bi-S-SRU learning network. IEEE Access 8:24784–24798. https://doi.org/10.1109/ACCESS.2020.2971253
Lu H, Ma X (2020) Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere 249:126169. https://doi.org/10.1016/j.chemosphere.2020.126169
Malik A, Tikhamarine Y, Souag-Gamane D, Kisi O, Pham QB (2020) Support vector regression optimized by meta-heuristic algorithms for daily streamflow prediction. Stoch Env Res Risk Assess 34(11):1755–1773. https://doi.org/10.1007/s00477-020-01874-1
Mani M, Bozorg-Haddad O, Chu X (2018) Ant lion optimizer (ALO) algorithm. Advanced optimization by nature-inspired algorithms. Springer, Singapore, pp 105–116
Mao Q, Zhang Q (2021) Improved sparrow algorithm combining Cauchy mutation and opposition-based learning. J Front Comp Sci Technol. https://doi.org/10.3778/j.issn.1673-9418.2010032
Meng E, Huang S, Huang Q et al (2019) A robust method for non-stationary streamflow prediction based on improved EMD-SVM model. J Hydrol 568:462–478. https://doi.org/10.1016/j.jhydrol.2018.11.015
Mirjalili S (2015) The ant lion optimizer. Adv Eng Softw 83:80–98. https://doi.org/10.1016/j.advengsoft.2015.01.010
Najah A, El-Shafie A, Karim OA, El-Shafie AH (2014) Performance of ANFIS versus MLP-NN dissolved oxygen prediction models in water quality monitoring. Environ Sci Pollut Res 21(3):1658–1670. https://doi.org/10.1007/s11356-013-2048-4
Nash JE, Sutcliffe JV (1970) River flow forecasting through conceptual models part I—a discussion of principles. J Hydrol 10(3):282–290. https://doi.org/10.1016/0022-1694(70)90255-6
Rahnamayan S, Tizhoosh HR, Salama MM (2008) Opposition-based differential evolution. IEEE Trans Evol Comput 12(1):64–79. https://doi.org/10.1109/TEVC.2007.894200
Rahnamayan S, Wang GG, Ventresca M (2012) An intuitive distance-based explanation of opposition-based sampling. Appl Soft Comput 12(9):2828–2839. https://doi.org/10.1016/j.asoc.2012.03.034
Rajaee T, Jafari H (2018) Utilization of WGEP and WDT models by wavelet denoising to predict water quality parameters in rivers. J Hydrol Eng 23(12):04018054. https://doi.org/10.1061/(ASCE)HE.1943-5584.0001700
Rajaee T, Khani S, Ravansalar M (2020) Artificial intelligence-based single and hybrid models for prediction of water quality in rivers: a review. Chemom Intell Lab Syst 200:103978. https://doi.org/10.1016/j.chemolab.2020.103978
Singh P (2018) Indian summer monsoon rainfall (ISMR) forecasting using time series data: a fuzzy-entropy-neuro based expert system. Geosci Front 9(4):1243–1257. https://doi.org/10.1016/j.gsf.2017.07.011
Song C, Yao L (2022) Application of artificial intelligence based on synchrosqueezed wavelet transform and improved deep extreme learning machine in water quality prediction. Environ Sci Pollut Res. https://doi.org/10.1007/s11356-022-18757-3
Song C, Yao L, Hua C, Ni Q (2021a) A novel hybrid model for water quality prediction based on synchrosqueezed wavelet transform technique and improved long short-term memory. J Hydrol 603:126879. https://doi.org/10.1016/j.jhydrol.2021.126879
Song C, Yao L, Hua C, Ni Q (2021b) A water quality prediction model based on variational mode decomposition and the least squares support vector machine optimized by the sparrow search algorithm (VMD-SSA-LSSVM) of the Yangtze River, China. Environ Monit Assess 193(6):1–17. https://doi.org/10.1007/s10661-021-09127-6
Sun L, Chen S, Xu J, Tian Y (2019) Improved monarch butterfly optimization algorithm based on opposition-based learning and random local perturbation. Complexity. https://doi.org/10.1155/2019/4182148
Tizhoosh HR (2005) Opposition-based learning: a new scheme for machine intelligence. In: International conference on computational intelligence for modelling, control and automation and international conference on intelligent agents, web technologies and internet commerce (CIMCA-IAWTIC'06) (Vol. 1, pp. 695–701). IEEE. https://doi.org/10.1109/CIMCA.2005.1631345
Torres ME, Colomina MA, Schlotthauer G, Flandrin P (2011) A complete ensemble empirical mode decomposition with adaptive noise. In: 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 4144–4147). IEEE. https://doi.org/10.1109/ICASSP.2011.5947265
Wang Y, Zhou J, Chen K, Wang Y, Liu L (2017) Water quality prediction method based on LSTM neural network. In: 2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE) (pp. 1–5). IEEE. https://doi.org/10.1109/ISKE.2017.8258814
Wang Y, Zhou X, Engel B (2018) Water environment carrying capacity in Bosten Lake basin. J Clean Prod 199:574–583. https://doi.org/10.1016/j.jclepro.2018.07.202
Wang J, Zhang L, Zhang W, Wang X (2019) Reliable model of reservoir water quality prediction based on improved ARIMA method. Environ Eng Sci 36(9):1041–1048. https://doi.org/10.1089/ees.2018.0279
Wang J, Xu Y, She C, Xu P, Bagal HA (2022) Optimal parameter identification of SOFC model using modified gray wolf optimization algorithm. Energy 240:122800. https://doi.org/10.1016/j.energy.2021.122800
Xu R, Xiong Q, Yi H, Wu C, Ye J (2019) Research on water quality prediction based on SARIMA-LSTM: a case study of Beilun Estuary. In: 2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS) (pp. 2183–2188). IEEE. https://doi.org/10.1109/HPCC/SmartCity/DSS.2019.00302
Yang HEJ (2009) An adaptive chaos immune optimization algorithm with mutative scale and its application. Control Theory Appl 6(10):1069–1074
Yang W, Xia K, Fan S, Wang L, Li T, Zhang J, Feng Y (2022) A multi-strategy whale optimization algorithm and its application. Eng Appl Artif Intell 108:104558. https://doi.org/10.1016/j.engappai.2021.104558
Yu Y, Zhang H, Singh VP (2018) Forward prediction of runoff data in data-scarce basins with an improved ensemble empirical mode decomposition (EEMD) model. Water 10(4):388. https://doi.org/10.3390/w10040388
Zhai X, Xia J, Zhang Y (2014) Water quality variation in the highly disturbed Huai River Basin, China from 1994 to 2005 by multi-statistical analyses. Sci Total Environ 496:594–606. https://doi.org/10.1016/j.scitotenv.2014.06.101
Zhang Y, Wen X, Jiang L, Liu J, Yang J, Liu S (2020) Prediction of high-quality reservoirs using the reservoir fluid mobility attribute computed from seismic data. J Petrol Sci Eng 190:107007. https://doi.org/10.1016/j.petrol.2020.107007
Zhou Y (2020) Real-time probabilistic forecasting of river water quality under data missing situation: deep learning plus post-processing techniques. J Hydrol 589:125164. https://doi.org/10.1016/j.jhydrol.2020.125164
Zhou J, Wang Y, Xiao F, Wang Y, Sun L (2018) Water quality prediction method based on IGRA and LSTM. Water 10(9):1148. https://doi.org/10.3390/w10091148
Zou Q, Xiong Q, Li Q, Yi H, Yu Y, Wu C (2020) A water quality prediction method based on the multi-time scale bidirectional long short-term memory network. Environ Sci Pollut Res. https://doi.org/10.1007/s11356-020-08087-7
Zubaidi SL, Al-Bugharbee H, Ortega-Martorell S et al (2020) A novel methodology for prediction urban water demand by wavelet denoising and adaptive neuro-fuzzy inference system approach. Water 12(6):1628. https://doi.org/10.3390/w12061628
Acknowledgements
The authors would like to thank the editors and anonymous reviewers for their constructive comments that greatly contributed to improving the manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Song, C., Yao, L. A hybrid model for water quality parameter prediction based on CEEMDAN-IALO-LSTM ensemble learning. Environ Earth Sci 81, 262 (2022). https://doi.org/10.1007/s12665-022-10380-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12665-022-10380-2