TY - JOUR
T1 - Comparison of Data Mining Techniques on Stroke Clinical Dataset
AU - Prasetyo, Viko Pradana
AU - Ulin Nuha, Muhammad Fajrul Alam
AU - Hakiki, Makhi Hakim
AU - Vinarti, Retno Aulia
AU - Djunaidy, Arif
N1 - Publisher Copyright:
© 2023 The Authors. Published by Elsevier B.V.
PY - 2024
Y1 - 2024
N2 - Stroke is a significant cause of mortality and morbidity worldwide, making it essential to identify individuals at risk of experiencing a stroke. The aim of this research article is to develop a predictive model to determine the stroke risk of individuals based on their medical history and compare the effectiveness of preprocessing techniques on the model's performance. The methodology involves two streams of analysis - with and without data preprocessing - utilizing classification models to predict stroke risk (K-Nearest Neighbor, Decision Tree and Support Vector Machine). The results indicate that data preprocessing improves the performance of all models, with KNN and SVM showing high precision and recall values, making them effective models for predicting strokes. Conversely, the decision tree model performs well with data preprocessing despite slightly lower accuracy and recall values. These findings suggest that preprocessing is a crucial stage in machine learning and can enhance the performance of classification models in predicting stroke risk.
AB - Stroke is a significant cause of mortality and morbidity worldwide, making it essential to identify individuals at risk of experiencing a stroke. The aim of this research article is to develop a predictive model to determine the stroke risk of individuals based on their medical history and compare the effectiveness of preprocessing techniques on the model's performance. The methodology involves two streams of analysis - with and without data preprocessing - utilizing classification models to predict stroke risk (K-Nearest Neighbor, Decision Tree and Support Vector Machine). The results indicate that data preprocessing improves the performance of all models, with KNN and SVM showing high precision and recall values, making them effective models for predicting strokes. Conversely, the decision tree model performs well with data preprocessing despite slightly lower accuracy and recall values. These findings suggest that preprocessing is a crucial stage in machine learning and can enhance the performance of classification models in predicting stroke risk.
KW - Classification
KW - Data Mining Techniques
KW - Decision Tree
KW - Health
KW - KNN
KW - SVM
KW - Stroke
UR - http://www.scopus.com/inward/record.url?scp=85193202511&partnerID=8YFLogxK
U2 - 10.1016/j.procs.2024.03.033
DO - 10.1016/j.procs.2024.03.033
M3 - Conference article
AN - SCOPUS:85193202511
SN - 1877-0509
VL - 234
SP - 502
EP - 511
JO - Procedia Computer Science
JF - Procedia Computer Science
T2 - 7th Information Systems International Conference, ISICO 2023
Y2 - 26 July 2023 through 28 July 2023
ER -