TY - GEN
T1 - Enhancing YAMNet Model for Lung Sound Classification to Identify Normal and Abnormal Conditions
AU - Arifin, Jaenal
AU - Sardjono, Tri Arief
AU - Kusuma, Hendra
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Healthy lung sounds are produced by airflow during normal breathing. Normal lung sounds lack additional sounds such as rhonchi, wheezing, stridor, or crackles. Abnormal lung sounds result from airflow through an impaired respiratory tract. This research classifies normal and abnormal lung sounds using the YAMNet model, a deep-learning model capable of identifying normal and abnormal lung sounds. The dataset for this research was obtained from Fortis Hospital India and Kaggle. The study involved comprehensive preprocessing of lung sound signals, including sampling at a frequency of 4 kHz, segmenting the lung sound signal for 6 seconds, and smoothing the signal using the Wavelet Smoothing technique, as well as Min-Max Normalization. A 10-fold cross-validation technique was employed, where each iteration used one of the ten parts as the test dataset and the other nine as the training dataset. This model was trained and tested using a 10fold cross-validation technique with an average accuracy of 92.02%. The research yielded accuracy values of 89.81%, precision of 88.53%, recall of 89.74%, and an F1-score of 89.14%.
AB - Healthy lung sounds are produced by airflow during normal breathing. Normal lung sounds lack additional sounds such as rhonchi, wheezing, stridor, or crackles. Abnormal lung sounds result from airflow through an impaired respiratory tract. This research classifies normal and abnormal lung sounds using the YAMNet model, a deep-learning model capable of identifying normal and abnormal lung sounds. The dataset for this research was obtained from Fortis Hospital India and Kaggle. The study involved comprehensive preprocessing of lung sound signals, including sampling at a frequency of 4 kHz, segmenting the lung sound signal for 6 seconds, and smoothing the signal using the Wavelet Smoothing technique, as well as Min-Max Normalization. A 10-fold cross-validation technique was employed, where each iteration used one of the ten parts as the test dataset and the other nine as the training dataset. This model was trained and tested using a 10fold cross-validation technique with an average accuracy of 92.02%. The research yielded accuracy values of 89.81%, precision of 88.53%, recall of 89.74%, and an F1-score of 89.14%.
KW - YAMNET model
KW - abnormal
KW - classification
KW - lung sounds
KW - normal
KW - signal pre-processing
UR - http://www.scopus.com/inward/record.url?scp=85198833829&partnerID=8YFLogxK
U2 - 10.1109/SIML61815.2024.10578277
DO - 10.1109/SIML61815.2024.10578277
M3 - Conference contribution
AN - SCOPUS:85198833829
T3 - 2024 International Conference on Smart Computing, IoT and Machine Learning, SIML 2024
SP - 117
EP - 123
BT - 2024 International Conference on Smart Computing, IoT and Machine Learning, SIML 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 International Conference on Smart Computing, IoT and Machine Learning, SIML 2024
Y2 - 6 June 2024 through 7 June 2024
ER -