TY - GEN
T1 - On selecting features for binary classification in microarray data analyses
AU - Purnami, Santi W.
AU - Andari, Shofi
AU - Rusydiana, Afifah W.
N1 - Publisher Copyright:
© 2017 ACM.
PY - 2017/2/24
Y1 - 2017/2/24
N2 - Feature selection has become the most interesting challenge in processing the analysis of high-dimensional microarray data. It addresses the issue of dimensionality reduction by obtaining important features to construct a good model, especially for classification. There are many different feature selection methods that have been proposed, developed, and, eventually, commonly used. Some of these methods are discussed by many researchers to be excellent to improve the accuracy of classification by taking care of redundant and irrelevant instances. The complicated associations among the genes in microarrays data tend to make works more difficult, but removing less important features can improve the accuracy. In this study, basic feature selection techniques were compared based on support vector machine performance in classifying binary classification problems. The experiments were based on high-dimensional microarray datasets which were preprocessed by reducing its dimensionality using correlation based feature selection and fast correlation based filter and evaluated based on classification accuracy resulted from support vector machine standard.
AB - Feature selection has become the most interesting challenge in processing the analysis of high-dimensional microarray data. It addresses the issue of dimensionality reduction by obtaining important features to construct a good model, especially for classification. There are many different feature selection methods that have been proposed, developed, and, eventually, commonly used. Some of these methods are discussed by many researchers to be excellent to improve the accuracy of classification by taking care of redundant and irrelevant instances. The complicated associations among the genes in microarrays data tend to make works more difficult, but removing less important features can improve the accuracy. In this study, basic feature selection techniques were compared based on support vector machine performance in classifying binary classification problems. The experiments were based on high-dimensional microarray datasets which were preprocessed by reducing its dimensionality using correlation based feature selection and fast correlation based filter and evaluated based on classification accuracy resulted from support vector machine standard.
KW - Feature selection
KW - Microarray analyses
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=85024397837&partnerID=8YFLogxK
U2 - 10.1145/3055635.3056644
DO - 10.1145/3055635.3056644
M3 - Conference contribution
AN - SCOPUS:85024397837
T3 - ACM International Conference Proceeding Series
SP - 133
EP - 136
BT - Proceedings of 2017 9th International Conference on Machine Learning and Computing, ICMLC 2017
PB - Association for Computing Machinery
T2 - 9th International Conference on Machine Learning and Computing, ICMLC 2017
Y2 - 24 February 2017 through 26 February 2017
ER -