Performance Analysis of Random Forest with Sampling for River Water Quality Classification

Rahmi Fadhilah*, Heri Kuswanto, Dedy Dwi Prastyo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The availability of quality water is essential for the survival of organisms on Earth. However, monitoring and managing the environment surrounding the water source is crucial to ensure water is safe for consumption. While classification methods are commonly used to assess water quality, challenges arise when data is imbalanced between classes. Data resampling techniques, including individual and combination, are often used to address this imbalance. This study aims to evaluate the performance of the Random Forest algorithm in classifying river water quality after resampling RACOG, RUS, and RACOG-RUS data, both with and without feature selection. Data derived from the Provincial Environment Office was evaluated using various performance metrics. Our findings show that the Random Forest model, especially with individual RACOG resampling, exhibits the most promising performance, while the RUS model shows less than optimal performance. Interestingly, the combined RACOG-RUS approach effectively addresses the class imbalance problem. Moreover, feature selection contributes to improved performance in both the RUS and RACOG-RUS models, with the RACOG model showing consistent performance. Notably, the RACOG-RUS model consistently excels in prediction accuracy and stability. This research confirms the adoption of a combination approach to address river water quality classification data imbalance.

Original languageEnglish
Title of host publication2024 7th International Conference on Informatics and Computational Sciences, ICICoS 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages456-461
Number of pages6
ISBN (Electronic)9798350375886
DOIs
Publication statusPublished - 2024
Event7th International Conference on Informatics and Computational Sciences, ICICoS 2024 - Hybrid, Semarang, Indonesia
Duration: 17 Jul 202418 Jul 2024

Publication series

NameProceedings - International Conference on Informatics and Computational Sciences
ISSN (Print)2767-7087

Conference

Conference7th International Conference on Informatics and Computational Sciences, ICICoS 2024
Country/TerritoryIndonesia
CityHybrid, Semarang
Period17/07/2418/07/24

Keywords

  • Classification
  • Imbalance
  • RACOG
  • RACOG-RUS
  • RUS
  • Random Forest

Fingerprint

Dive into the research topics of 'Performance Analysis of Random Forest with Sampling for River Water Quality Classification'. Together they form a unique fingerprint.

Cite this