Water quality classification using machine learning algorithms

Monitoring water quality is essential for protecting human health and the environment and controlling water quality. Artificial Intelligence (AI) offers significant opportunities to help improve the classification and prediction of water quality (WQ). In this study, various AI algorithms are assesse...

Full description

Saved in:
Bibliographic Details
Published in:Journal of water process engineering Vol. 48; p. 102920
Main Authors: Nasir, Nida, Kansal, Afreen, Alshaltone, Omar, Barneih, Feras, Sameer, Mustafa, Shanableh, Abdallah, Al-Shamma'a, Ahmed
Format: Journal Article
Language:English
Published: Elsevier Ltd 01.08.2022
Subjects:
ISSN:2214-7144, 2214-7144
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Monitoring water quality is essential for protecting human health and the environment and controlling water quality. Artificial Intelligence (AI) offers significant opportunities to help improve the classification and prediction of water quality (WQ). In this study, various AI algorithms are assessed to handle WQ data collected over an extended period and develop a dependable approach for forecasting water quality as accurately as possible. Specifically, various machine learning classifiers and their stacking ensemble models were used to classify the WQ data via the Water Quality Index (WQI). The studied classifiers included Support Vector Machine (SVM), Random Forest (RF), Logistic Regression (LR), Decision Tree (DT), CATBoost, XGBoost, and Multilayer Perceptron (MLP). The dataset used in the study included 1679 samples and their meta-data collected over nine years. In addition, precision-recall curves and Receiver Operating Characteristic curves (ROC) were used to assess the performance of the various classifiers. The findings revealed that the CATBoost model offered the most accurate classifier with a percentage of 94.51. Moreover, after applying stacking ensemble models with all classifiers, accuracy reached 100% in various Meta-classifiers. Furthermore, the CATBoost achieved the highest accuracy as a primary gradient boosting algorithm and a meta classifier. Therefore, the boosting algorithm is proposed as a reliable approach for the WQ classification. The analysis presented in this article presents a framework that can support the efforts of researchers working toward water quality improvement using artificial intelligence. [Display omitted] •Water quality index (WQI) turn complex water quality data into understandable information.•Seven individual classifiers have been developed to predict the Water Quality Index (WQI).•Stack modelling proved successful in predicting the quality of water.•CATBOOST approach obtained the best predictive results.
ISSN:2214-7144
2214-7144
DOI:10.1016/j.jwpe.2022.102920