Skip to main content

Advertisement

Log in

An unsupervised water quality anomaly detection method based on a combination of time-frequency analysis and clustering

  • Research Article
  • Published:
Environmental Science and Pollution Research Aims and scope Submit manuscript

Abstract

Anomalies in water quality, which frequently arise due to pollution, constitute a substantial menace to human health. The preservation of public welfare critically entails the timely recognition of abnormal water quality. Conventional techniques for detecting water quality anomalies face obstacles such as the necessity of expert knowledge, limited accuracy in detection, and delays in identification. In this paper, we proposed an original unsupervised technique for identifying water quality anomalies combined with time-frequency analysis and clustering (TCAD). We chose time-frequency analysis because it effectively evaluates water quality changes, generating distinct multi-band signals that reflect different aspects of water quality dynamics. We also proposed a clustering technique which can identify water quality markers and amalgamate data from multi-band signals for accurate anomaly detection. We seek to clarify the reasoning behind our methodology by portraying how time-frequency analysis and clustering address the deficiencies of conventional methods. Our experiments evaluated various indicators of water quality, and the effectiveness of our proposed approach was supported by comparative analyses with commonly used models for detecting anomalies in water quality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Availability of data and materials

All data generated or analyzed during this study are included in this published article.

References

  • Akbari Z, Unland R (2016) Automated determination of the input parameter of DBSCAN based on outlier detection. In: Artificial Intelligence Applications and Innovations: 12th IFIP WG 12.5 International Conference and Workshops, AIAI 2016, Thessaloniki, Greece, September 16-18, 2016, Proceedings 12, Springer, pp 280–291

  • An P, Wang Z, Zhang C (2022) Ensemble unsupervised autoencoders and gaussian mixture model for cyberattack detection. Inf Process Manag 59(2):102844

    Article  Google Scholar 

  • Belhadi A, Djenouri Y, Srivastava G et al (2020) A two-phase anomaly detection model for secure intelligent transportation ride-hailing trajectories. IEEE Trans Intell Transp Syst 22(7):4496–4506

    Article  Google Scholar 

  • Dogo EM, Nwulu NI, Twala B et al (2019) A survey of machine learning methods applied to anomaly detection on drinking-water quality data. Urban Water J 16(3):235–248

    Article  Google Scholar 

  • Erfani SM, Rajasegarar S, Karunasekera S et al (2016) High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning. Pattern Recognit 58:121–134

    Article  ADS  Google Scholar 

  • Foroumandi E, Nourani V, Dabrowska D et al (2022) Linking spatial-temporal changes of vegetation cover with hydroclimatological variables in terrestrial environments with a focus on the lake Urmia basin. Land 11(1):115

    Article  Google Scholar 

  • Foroumandi E, Nourani V, Kantoush SA (2022) Investigating the main reasons for the tragedy of large saline lakes: drought, climate change, or anthropogenic activities? A call to action. Journal of Arid Environ 196(104):652

    Google Scholar 

  • Foroumandi E, Nourani V, Huang JJ et al (2023) Drought monitoring by downscaling grace-derived terrestrial water storage anomalies: a deep learning approach. Journal of Hydrology 616

  • He HM, Hou DB, Zhao HF et al (2013) Multi-parameters fusion algorithm for detecting anomalous water quality. J Zhejiang Univ Eng Sci 47(4):735–740

    Google Scholar 

  • Hilal W, Gadsden SA, Yawney J (2021) A review of anomaly detection techniques and applications in financial fraud. Expert Syst Appl 116429

  • Huang P, Wang L, Hou D et al (2021) A feature extraction method based on the entropy-minimal description length principle and GBDT for common surface water pollution identification. J Hydroinformatics 23(5):1050–1065

    Article  Google Scholar 

  • Jalal D, Ezzedine T (2020) Decision tree and support vector machine for anomaly detection in water distribution networks. In: 2020 International Wireless Communications and Mobile Computing (IWCMC), IEEE, pp 1320–1323

  • Jiang X, Song Q, Du G et al (2023) A review on research and application of variational mode decomposition. J Instrum

  • Lesouple J, Baudoin C, Spigai M et al (2021) Generalized isolation forest for anomaly detection. Pattern Recognit Lett 149:109–119

    Article  ADS  Google Scholar 

  • Liu J, Wang P, Jiang D et al (2020) An integrated data-driven framework for surface water quality anomaly detection and early warning. J Clean Prod 251:119145

    Article  CAS  Google Scholar 

  • Lung WS (2022) River bod/do modeling at new normal. In: Water Quality Modeling That Works. Springer, p 69–105

  • McKenna SA, Klise KA, Wilson MP (2008) Testing water quality change detection algorithms. Water Distribution Systems Analysis Symposium 2006:1–15

    Google Scholar 

  • Mendivil-García K, Amabilis-Sosa LE, Salinas-Juárez MG et al (2022) Climate change impact assessment on a tropical river resilience using the Streeter-Phelps dissolved oxygen model. Front Environ Sci 10

  • Muharemi F, Logofătu D, Leon F (2019) Machine learning approaches for anomaly detection of water quality on a real-world data set. J Inf Telecommun 3(3):294–307

    Google Scholar 

  • Narayan A, Madhu Kumar S, Chacko AM (2023) A review of financial fraud detection in e-commerce using machine learning. In: Intelligent Data Engineering and Analytics: Proceedings of the 10th International Conference on Frontiers in Intelligent Computing: Theory and Applications (FICTA 2022), Springer, pp 237–248

  • Ni Q, Cao X, Tan C et al (2022) An improved graph convolutional network with feature and temporal attention for multivariate water quality prediction. Environ Sci Pollut Res 1–14

  • Nourani V, Ghaneei P, Kantoush SA (2022) Robust clustering for assessing the spatiotemporal variability of groundwater quantity and quality. J Hydrol 604:127272

    Article  CAS  Google Scholar 

  • Oliker N, Ostfeld A (2014) Comparison of multivariate classification methods for contamination event detection in water distribution systems. Procedia Eng 70:1271–1279

    Article  Google Scholar 

  • Pang G, Shen C, Cao L et al (2021) Deep learning for anomaly detection: a review. ACM computing surveys (CSUR) 54(2):1–38

  • Peterson KT, Sagan V, Sloan JJ (2020) Deep learning-based water quality estimation and anomaly detection using Landsat-8/Sentinel-2 virtual constellation and cloud computing. GIScience Remote Sens 57(4):510–525

    Article  Google Scholar 

  • Qin w, Fu y (2018) Water quality abnormity inspection and detection via the vector auto-regressive model. J Saf Environ 18(04):1560–1563

  • Raciti M, Cucurull J, Nadjm-Tehrani S (2012) Anomaly detection in water management systems. In: Critical infrastructure protection. Springer, p 98–119

  • Rodriguez-Perez J, Leigh C, Liquet B et al (2020) Detecting technical anomalies in high-frequency water-quality data using artificial neural networks. Environ Sci Technol 54(21):13719–13730

    Article  CAS  PubMed  ADS  Google Scholar 

  • Shaukat K, Alam TM, Luo S et al (2021) A review of time-series anomaly detection techniques: a step to future perspectives. In: Advances in Information and Communication: Proceedings of the 2021 Future of Information and Communication Conference (FICC), Volume 1, Springer, pp 865–877

  • Zhang J, Zhu X, Yue Y et al (2017) A real-time anomaly detection algorithm/or water quality data using dual time-moving windows. In: 2017 Seventh international conference on innovative computing technology (INTECH), IEEE, pp 36–41

  • Zhi X, Jing M, Hao W et al (2019) Key indicator and critical condition for the water resource carrying capacity in the Yangtze river estuary. Journal of Tsinghua University (Science and Technology) 59(5):364–372

    Google Scholar 

  • Zhong S, Zhang K, Bagheri M et al (2021) Machine learning: new ideas and tools in environmental science and engineering. Environ Sci Technol 55(19):12741–12754

    CAS  PubMed  Google Scholar 

Download references

Funding

This paper is supported by the National Natural Science Foundation of China (12273003).

Author information

Authors and Affiliations

Authors

Contributions

The study conception and design, data collection and analysis, guide of experiments, and paper revision were performed by Qingjian Ni. The design of model, code implementation, experiments, the first draft of the manuscript, and paper revision were performed by Xuehan Cao. Jiayi Yuan and Chaoqun Tan commented on previous versions of the manuscript and made suggestions for revision. Ziqi Zhao was responsible for creating specific diagrams and making various revisions to this paper.

Corresponding author

Correspondence to Qingjian Ni.

Ethics declarations

Ethical approval and consent to participate

All authors have read and agreed to the published version of the manuscript.

Consent for publication

All authors have read and agreed to the published version of the manuscript.

Conflict of interest

The authors declare no competing interests.

Additional information

Responsible editor: Xianliang Yi.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ni, Q., Cao, X., Zhao, Z. et al. An unsupervised water quality anomaly detection method based on a combination of time-frequency analysis and clustering. Environ Sci Pollut Res 31, 15920–15931 (2024). https://doi.org/10.1007/s11356-024-32170-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11356-024-32170-y

Keywords

Navigation