TY - GEN
T1 - Implementation of Synthetic Minority Over-Sampling Technique in the Anaemia Classification Using the LSTM and Bi-LSTM Algorithms
AU - Pamungkas, Yuri
AU - Indriani, Ratri Dwi
AU - Syulthoni, Zain Budi
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Anaemiais a health disorder characterized by a lack of red blood cells in a person's body. Anaemia sufferers will tire more quickly, and their faces look paler than normal people's. Anaemia can occur over a short or long period (depending on the severity). If anaemia continues to be ignored, complications can worsen a person's health condition. Considering the impact caused by anaemia, a solution is needed to detect anaemia precisely and accurately. One way can be taken is by utilizing an artificial intelligence-based system to detect or classify anaemia based on its symptoms. Therefore, we tried to analyze factors related to anaemia in this study and carry out anaemia classification based on the LSTM and Bi-LSTM algorithms. The dataset used in this study came from the Kaggle repository, which contains medical record information for 1421 patients (620 patients diagnosed with anaemia and 801 patients with non-anaemia). The patient's medical record information includes gender, blood haemoglobin level, MCH, MCHC, MCV, and diagnosis results. In the research dataset, we also applied SMOTE to balance the data classes for anaemia and non-anaemia sufferers and compare the classification results' performance. Based on the research results, the haemoglobin level factor has the highest correlation value of 0.8 compared to other factors such as gender (0.25), MCHC (0.05), MCH (0.03), and MCV (0.02). Meanwhile, the classification results show that the use of SMOTE can increase the specificity (100%), precision (100%), F1-score (98.93%), and accuracy (98.86%) of the LSTM algorithm during the classification process.
AB - Anaemiais a health disorder characterized by a lack of red blood cells in a person's body. Anaemia sufferers will tire more quickly, and their faces look paler than normal people's. Anaemia can occur over a short or long period (depending on the severity). If anaemia continues to be ignored, complications can worsen a person's health condition. Considering the impact caused by anaemia, a solution is needed to detect anaemia precisely and accurately. One way can be taken is by utilizing an artificial intelligence-based system to detect or classify anaemia based on its symptoms. Therefore, we tried to analyze factors related to anaemia in this study and carry out anaemia classification based on the LSTM and Bi-LSTM algorithms. The dataset used in this study came from the Kaggle repository, which contains medical record information for 1421 patients (620 patients diagnosed with anaemia and 801 patients with non-anaemia). The patient's medical record information includes gender, blood haemoglobin level, MCH, MCHC, MCV, and diagnosis results. In the research dataset, we also applied SMOTE to balance the data classes for anaemia and non-anaemia sufferers and compare the classification results' performance. Based on the research results, the haemoglobin level factor has the highest correlation value of 0.8 compared to other factors such as gender (0.25), MCHC (0.05), MCH (0.03), and MCV (0.02). Meanwhile, the classification results show that the use of SMOTE can increase the specificity (100%), precision (100%), F1-score (98.93%), and accuracy (98.86%) of the LSTM algorithm during the classification process.
KW - Anaemia Classification
KW - Bidirectional LSTM
KW - Correlation Analysis
KW - Long-Short Term Memory
KW - SMOTE
UR - http://www.scopus.com/inward/record.url?scp=85214685847&partnerID=8YFLogxK
U2 - 10.1109/EECSI63442.2024.10776106
DO - 10.1109/EECSI63442.2024.10776106
M3 - Conference contribution
AN - SCOPUS:85214685847
T3 - International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)
SP - 359
EP - 364
BT - Proceedings - 2024 11th International Conference on Electrical Engineering, Computer Science and Informatics, EECSI 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th International Conference on Electrical Engineering, Computer Science and Informatics, EECSI 2024
Y2 - 26 September 2024 through 27 September 2024
ER -