Application of classification machine learning algorithms for characterizing nutrient transport in a clay plain agricultural watershed
Excess nutrients in surface water and groundwater can lead to water quality deterioration in available water resources. Thus, the classification of nutrient concentrations in water resources has gained significant attention during recent decades. Machine learning (ML) algorithms are considered an ef...
Uloženo v:
| Vydáno v: | Journal of environmental management Ročník 345; s. 118924 |
|---|---|
| Hlavní autoři: | , , , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Elsevier Ltd
01.11.2023
|
| Témata: | |
| ISSN: | 0301-4797, 1095-8630, 1095-8630 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | Excess nutrients in surface water and groundwater can lead to water quality deterioration in available water resources. Thus, the classification of nutrient concentrations in water resources has gained significant attention during recent decades. Machine learning (ML) algorithms are considered an efficient tool to describe nutrient loss from agricultural land to surface water and groundwater. Previous studies have applied regression and classification ML algorithms to predict nutrient concentrations in surface water and/or groundwater, or to categorize an output variable using a limited number of input variables. However, there have been no studies that examined the application of different ML classification algorithms in agricultural settings to classify various output variables using a wide range of input variables. In this study, twenty-four ML classification algorithms were implemented on a dataset from three locations within the Upper Parkhill watershed, an agricultural watershed in southern Ontario, Canada. Nutrient concentrations in surface water were classified using geochemical and physical water parameters of surface water and groundwater (e.g., pH), climate and field conditions as the input variables. The performance of these algorithms was evaluated using four evaluation metrics (e.g., classification accuracy) to identify the optimal algorithm for classifying the output variables. Ensemble bagged trees was found to be the optimal ML algorithm for classifying nitrate concentration in surface water (accuracy of 90.9%), while the weighted KNN was the most appropriate algorithm for categorizing the total phosphorus concentration (accuracy of 87%). The ensemble subspace discriminant algorithm gave the highest overall classification accuracy for the concentration of soluble reactive phosphorus and total dissolved phosphorus in surface water with an accuracy of 79.2% and 77.9%, respectively. This study exemplifies that ML algorithms can be used to signify exceedance of recommended concentrations of nutrients in surface waters in agricultural watersheds. Results are useful for decision makers to develop nutrient management strategies.
[Display omitted]
•Machine learning algorithms were applied on a data from an agricultural watershed.•Output variables were nutrient concentrations in surface water in the watershed.•Performance of algorithms was assessed using four evaluation metrics.•Interdependence between different nutrient transport variables was investigated.•Machine learning results can be used for nutrient management and decision making. |
|---|---|
| Bibliografie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0301-4797 1095-8630 1095-8630 |
| DOI: | 10.1016/j.jenvman.2023.118924 |