Water Quality Classification and Machine Learning Model for Predicting Water Quality Status-A Study on Loa River Located in an Extremely Arid Environment: Atacama Desert

Flores, Victor; Bravo, Ingrid; Saavedra, Marcelo

Abstract

Water is the most important resource for human, animal, and vegetal life. Recently, the use of artificial intelligence techniques, such as Random Forest, has been combined with other techniques, such as models of logical-mathematical reasoning, to generate predictive water quality models. In this study, a rule-based inference technique to generate water quality labels is described, using historical physicochemical parameter data on seven water monitoring stations in Loa River, collected by the Chilean Ministry of the Environment. Next, a predictive model of water quality status was created, using Random Forest, physicochemical parameters, and expert knowledge. The validation of Random Forest results is described using three quality indicators from the machine learning model: accuracy (acc), precision (p), and recall (r). This paper describes dataset preparation, the refinement of the threshold values used for the physicochemical parameters most significant in the class, and the predictive model labeling water quality. The models obtained yielded the following mean values: acc = 0.897, p = 89.73, and r = 0.928. The ML model reported here is novel since no previous studies of this kind predict the water quality of Loa River, located in an extremely arid zone. This study also helps to create specific knowledge to predict freshwater quality.

Más información

Título según WOS: ID WOS:001056774500001 Not found in local WOS DB
Título de la Revista: Water
Volumen: 15
Número: 16
Editorial: MDPI
Fecha de publicación: 2023
DOI:

10.3390/w15162868

Notas: ISI