TY - JOUR
T1 - Cluster analysis-based approach features selection on machine learning for detecting intrusion
AU - Aziz, Mohammad Nasrul
AU - Ahmad, Tohari
N1 - Publisher Copyright:
© 2008 The Intelligent Networks and Systems Society.
PY - 2019
Y1 - 2019
N2 - Various machine learning technology approaches have been applied to intrusion detection system (IDS). To get optimal results, it needs to take several stages for processing the traffics. Among them is the feature selection method, where irrelevant and redundant features are removed. In the previous research, the system is developed based on feature grouping that used a clustering approach as the evaluation criteria. In this research, we propose a method for improving the performance of machine learning with the feature selection approach based on feature clustering. We propose cluster based feature selection derived from the value of mutual information and Pearson correlation. The cluster hierarchy is used in forming filters that are used to create selected and reduced clusters. In developing the cluster hierarchy, single, complete, and average linkage method are used to determine the formation of the best feature clusters. The classification method with Support Vector Machine (SVM), Naïve Bayes, and J48 decision tree are applied to observe the performance of the proposed feature selection. Based on the experimental results, we find that the highest accuracy (i.e., 99.842%) is obtained when a single linkage in the J48 classification is implemented in the Kyoto 2006 dataset.
AB - Various machine learning technology approaches have been applied to intrusion detection system (IDS). To get optimal results, it needs to take several stages for processing the traffics. Among them is the feature selection method, where irrelevant and redundant features are removed. In the previous research, the system is developed based on feature grouping that used a clustering approach as the evaluation criteria. In this research, we propose a method for improving the performance of machine learning with the feature selection approach based on feature clustering. We propose cluster based feature selection derived from the value of mutual information and Pearson correlation. The cluster hierarchy is used in forming filters that are used to create selected and reduced clusters. In developing the cluster hierarchy, single, complete, and average linkage method are used to determine the formation of the best feature clusters. The classification method with Support Vector Machine (SVM), Naïve Bayes, and J48 decision tree are applied to observe the performance of the proposed feature selection. Based on the experimental results, we find that the highest accuracy (i.e., 99.842%) is obtained when a single linkage in the J48 classification is implemented in the Kyoto 2006 dataset.
KW - Classification
KW - Data mining
KW - Intrusion detection
KW - Machine learning
KW - Network security
UR - http://www.scopus.com/inward/record.url?scp=85068557568&partnerID=8YFLogxK
U2 - 10.22266/ijies2019.0831.22
DO - 10.22266/ijies2019.0831.22
M3 - Article
AN - SCOPUS:85068557568
SN - 2185-310X
VL - 12
SP - 233
EP - 243
JO - International Journal of Intelligent Engineering and Systems
JF - International Journal of Intelligent Engineering and Systems
IS - 4
ER -