TY - JOUR
T1 - Microarray classification using genetic algorithm and latin hypercube sampling
AU - Awangditama, Bangun Rizki
AU - Suciati, Nanik
N1 - Publisher Copyright:
© 2024 Institute of Advanced Engineering and Science. All rights reserved.
PY - 2024/9
Y1 - 2024/9
N2 - Cancer, the second leading cause of global death, requires advanced diagnostic technology. Microarray gene expression technology plays an important role in comprehensively analyzing the genetic aspects of cancer. However, challenges such as high-dimensional attributes, limited samples, and varying gene presence rates hinder the accurate classification of microarray data. This study proposes a model that uses latin hypercube sampling (LHS) in genetic algorithms (GA) for Feature Selection in microarray data classification. LHS makes the chromosome samples in the initial population of GAs representative and diverse. The study used three microarray datasets with different numbers of features and classes. The results reveal that first, the use of GA alone tends to limit the exploration of the resulting feature space, while the use of LHS can expand the feature selection possibilities in the context of feature selection. Secondly, this study shows that microarray classification using GA with LHS (GALHS) consistently outperforms other feature selection methods such as based correlation features (BCF), principal component analysis (PCA), relief, and lasso. Thus, this research contributes to feature selection by applying LHS and GA to optimize the performance of microarray data classification models.
AB - Cancer, the second leading cause of global death, requires advanced diagnostic technology. Microarray gene expression technology plays an important role in comprehensively analyzing the genetic aspects of cancer. However, challenges such as high-dimensional attributes, limited samples, and varying gene presence rates hinder the accurate classification of microarray data. This study proposes a model that uses latin hypercube sampling (LHS) in genetic algorithms (GA) for Feature Selection in microarray data classification. LHS makes the chromosome samples in the initial population of GAs representative and diverse. The study used three microarray datasets with different numbers of features and classes. The results reveal that first, the use of GA alone tends to limit the exploration of the resulting feature space, while the use of LHS can expand the feature selection possibilities in the context of feature selection. Secondly, this study shows that microarray classification using GA with LHS (GALHS) consistently outperforms other feature selection methods such as based correlation features (BCF), principal component analysis (PCA), relief, and lasso. Thus, this research contributes to feature selection by applying LHS and GA to optimize the performance of microarray data classification models.
KW - Classification
KW - Feature selection
KW - Genetic algorithm
KW - Latin hypercube sampling
KW - Microarray
UR - http://www.scopus.com/inward/record.url?scp=85198143649&partnerID=8YFLogxK
U2 - 10.11591/ijeecs.v35.i3.pp1976-1985
DO - 10.11591/ijeecs.v35.i3.pp1976-1985
M3 - Article
AN - SCOPUS:85198143649
SN - 2502-4752
VL - 35
SP - 1976
EP - 1985
JO - Indonesian Journal of Electrical Engineering and Computer Science
JF - Indonesian Journal of Electrical Engineering and Computer Science
IS - 3
ER -