Oversampling Imbalance Data: Case Study on Functional and Non Functional Requirement

Lukman Hakim, Siti Rochimah

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Non-functional requirements are one of the important factors that have a role to determine the success of software development. This factor is usually often ignored by developers because it is difficult to identify. Difficulties in identifying non-functional requirements carry a negative impact affecting software quality. A new framework is needed to identify non-functional requirements. The existing approach has not been able to properly identify non-functional requirements in case studies of data imbalance. Data imbalance may affect the performance of classification methods. Therefore, this study proposes a comparison of three methods namely RF, KNN and SVM using SMOTE. In this research, The total dataset used is 1366 requirements. The dataset includes unbalanced data, which consists of 1141 functional requirements and 225 non-functional requirements. The result of this research prove that data imbalance cause decreases the accuracy. This is evidenced by testing using three methods namely SVM, KNN, and RF. All three methods have the highest accuracy value in minority class 100%. While the best method with the highest accuracy value obtained by SVM because it proved to have better performance than KNN, and RF in terms of data imbalance with 100% minority class.

Original languageEnglish
Title of host publication2018 Electrical Power, Electronics, Communications, Controls and Informatics Seminar, EECCIS 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages315-319
Number of pages5
ISBN (Electronic)9781538652510
DOIs
Publication statusPublished - 2 Jul 2018
Event2018 Electrical Power, Electronics, Communications, Controls and Informatics Seminar, EECCIS 2018 - Batu, East Java, Indonesia
Duration: 9 Oct 201811 Oct 2018

Publication series

Name2018 Electrical Power, Electronics, Communications, Controls and Informatics Seminar, EECCIS 2018

Conference

Conference2018 Electrical Power, Electronics, Communications, Controls and Informatics Seminar, EECCIS 2018
Country/TerritoryIndonesia
CityBatu, East Java
Period9/10/1811/10/18

Keywords

  • Data Imbalance
  • Non-functional
  • Smote

Fingerprint

Dive into the research topics of 'Oversampling Imbalance Data: Case Study on Functional and Non Functional Requirement'. Together they form a unique fingerprint.

Cite this