TY - GEN
T1 - Software complexity metric-based defect classification using FARM with preprocessing step CFS and SMOTE a preliminary study
AU - Naufal, Mohammad Farid
AU - Rochimah, Siti
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/3/21
Y1 - 2016/3/21
N2 - One criteria for assessing the software quality is ensuring that there is no defect in the software which is being developed. Software defect classification can be used to prevent software defects. More earlier software defects are detected in the software life cycle, it will minimize the software development costs. This study proposes a software defect classification using Fuzzy Association Rule Mining (FARM) based on complexity metrics. However, not all complexity metrics affect on software defect, therefore it requires metrics selection process using Correlation-based Feature Selection (CFS) so it can increase the classification performance. This study will conduct experiments on the NASA MDP open source dataset that is publicly accessible on the PROMISE repository. This datasets contain history log of software defects based on software complexity metric. In NASA MDP dataset the data distribution between defective and not defective modules are not balanced. It is called class imbalanced problem. Class imbalance problem can affect on classification performance. It needs a technique to solve this problem using oversampling method. Synthetic Minority Oversampling Technique (SMOTE) is used in this study as oversampling method. With the advantages possessed by FARM in learning on dataset which has quantitative data attribute and combined with the software complexity metrics selection process using CFS and oversampling using SMOTE, this method is expected has a better performance than the previous methods.
AB - One criteria for assessing the software quality is ensuring that there is no defect in the software which is being developed. Software defect classification can be used to prevent software defects. More earlier software defects are detected in the software life cycle, it will minimize the software development costs. This study proposes a software defect classification using Fuzzy Association Rule Mining (FARM) based on complexity metrics. However, not all complexity metrics affect on software defect, therefore it requires metrics selection process using Correlation-based Feature Selection (CFS) so it can increase the classification performance. This study will conduct experiments on the NASA MDP open source dataset that is publicly accessible on the PROMISE repository. This datasets contain history log of software defects based on software complexity metric. In NASA MDP dataset the data distribution between defective and not defective modules are not balanced. It is called class imbalanced problem. Class imbalance problem can affect on classification performance. It needs a technique to solve this problem using oversampling method. Synthetic Minority Oversampling Technique (SMOTE) is used in this study as oversampling method. With the advantages possessed by FARM in learning on dataset which has quantitative data attribute and combined with the software complexity metrics selection process using CFS and oversampling using SMOTE, this method is expected has a better performance than the previous methods.
KW - Bugs
KW - Correlation-based Feature Selection
KW - Defect
KW - Fault
KW - Fuzzy Association Rule Mining
KW - Machine Learning
KW - Software Defect Classification
KW - Synthetic Minority Oversampling Technique
UR - http://www.scopus.com/inward/record.url?scp=84966728275&partnerID=8YFLogxK
U2 - 10.1109/ICITSI.2015.7437685
DO - 10.1109/ICITSI.2015.7437685
M3 - Conference contribution
AN - SCOPUS:84966728275
T3 - 2015 International Conference on Information Technology Systems and Innovation, ICITSI 2015 - Proceedings
BT - 2015 International Conference on Information Technology Systems and Innovation, ICITSI 2015 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Information Technology Systems and Innovation, ICITSI 2015
Y2 - 16 November 2015 through 19 November 2015
ER -