TY - JOUR
T1 - Deep Semantic Feature Extraction to Overcome Overlapping Frequencies for Instrument Recognition in Indonesian Traditional Music Orchestras
AU - Nurdiyah, Dewi
AU - Yuniarno, Eko Mulyanto
AU - Wulandari, Sari Ayu
AU - Surapto, Yoyon Kusnendar
AU - Purnomo, Mauridhi Hery
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024
Y1 - 2024
N2 - In Indonesian traditional music, specifically Gamelan, overlapping fundamental frequencies occur among different instruments due to certain tones being tuned in the same octaves. This issue is challenging when the instruments are played simultaneously in the musical orchestras, resulting in mixed frequencies. This study utilizes Gamelan music dataset to address this issue by extracting deep semantic features to capture the distinctive characteristics of each instrument in the orchestras. We propose the fusion of Multi-Task Learning Autoencoder (MTL-AE) with Affine Transformation (AFT) to extract deep semantic features by investigating the optimal input derived from the Log Mel Spectrogram and Mel Frequency Cepstral Coefficient (MFCC). MTL-AE simultaneously extracts deep semantic features from eight instruments in the orchestras. AFT preserves these features according to the instrument class. The optimal extraction method was investigated by comparing the proposed method with baseline methods from MHU-Net and MHU-Net enhanced with Feature-wise Linear Modulation (FiLM). Subsequently, arranging deep semantic features from all instruments aims to obtain the structured feature patterns of eight instrument sources in the orchestras. Machine learning classifiers utilize structured deep semantic features for instrument recognition in the orchestras. Performance comparisons were executed against features derived from vanilla Log Mel Spectrogram, MFCC, Principal Component Analysis (PCA), Modified ResNet-50, MobileNet V3, and YAMNET. The results show that the deep semantic features, extracted using the proposed method with input from MFCC, contribute to the structured deep semantic feature to achieve superior accuracy up to 99%. Hence, these features effectively overcome the issue of overlapping frequencies in musical orchestras.
AB - In Indonesian traditional music, specifically Gamelan, overlapping fundamental frequencies occur among different instruments due to certain tones being tuned in the same octaves. This issue is challenging when the instruments are played simultaneously in the musical orchestras, resulting in mixed frequencies. This study utilizes Gamelan music dataset to address this issue by extracting deep semantic features to capture the distinctive characteristics of each instrument in the orchestras. We propose the fusion of Multi-Task Learning Autoencoder (MTL-AE) with Affine Transformation (AFT) to extract deep semantic features by investigating the optimal input derived from the Log Mel Spectrogram and Mel Frequency Cepstral Coefficient (MFCC). MTL-AE simultaneously extracts deep semantic features from eight instruments in the orchestras. AFT preserves these features according to the instrument class. The optimal extraction method was investigated by comparing the proposed method with baseline methods from MHU-Net and MHU-Net enhanced with Feature-wise Linear Modulation (FiLM). Subsequently, arranging deep semantic features from all instruments aims to obtain the structured feature patterns of eight instrument sources in the orchestras. Machine learning classifiers utilize structured deep semantic features for instrument recognition in the orchestras. Performance comparisons were executed against features derived from vanilla Log Mel Spectrogram, MFCC, Principal Component Analysis (PCA), Modified ResNet-50, MobileNet V3, and YAMNET. The results show that the deep semantic features, extracted using the proposed method with input from MFCC, contribute to the structured deep semantic feature to achieve superior accuracy up to 99%. Hence, these features effectively overcome the issue of overlapping frequencies in musical orchestras.
KW - Affine transformation
KW - Gamelan music dataset
KW - Indonesian traditional music
KW - autoencoder
KW - deep semantic feature extraction
KW - feature embeddings
KW - musical instrument classification
KW - musical instrument recognition
KW - overlapping frequency
UR - http://www.scopus.com/inward/record.url?scp=85193515989&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2024.3401699
DO - 10.1109/ACCESS.2024.3401699
M3 - Article
AN - SCOPUS:85193515989
SN - 2169-3536
VL - 12
SP - 76936
EP - 76954
JO - IEEE Access
JF - IEEE Access
ER -