Assessing the Effectiveness of Oversampling and Undersampling Techniques for Intrusion Detection on an Imbalanced Dataset

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

The imbalanced class distribution in intrusion detection systems has been a significant issue. Imbalanced class distribution can negatively impact the performance of intrusion detection systems as they may be biased towards the majority class. We explore the effectiveness of oversampling and under-sampling techniques to address this issue. Oversampling and undersampling techniques aim to balance the class distribution and improve the performance of the intrusion detection system. Oversampling increases the number of records in the minority class to make it closer in size to the majority class. Conversely, undersampling reduces the number of records in the majority class so that it is closer in size to the minority class. We assess the effectiveness of different oversampling and undersampling techniques, including Random OverSampling, SMOTE, ADASYN, Random UnderSampling, AllKNN, TomekLinks, SMOTEENN, and SMOTETomek. The experiment's findings indicate that the raw data achieved the highest accuracy score, 0.965. On the other hand, the Random Oversampling method yielded the highest F1 score, reaching a score of 0.589. When we see the evaluation scores of each class, the recall & F1 scores generally show high contrast between classes with a large amount of data and classes with (previously) a small amount of data, even though the data for training has been more balanced. We found that oversampling and undersampling can improve the performance of intrusion detection systems in specific ways, but this still needs improvement. These results can serve as a reference for researchers developing intrusion detection systems.

Original languageEnglish
Title of host publicationIEACon 2023 - 2023 IEEE Industrial Electronics and Applications Conference
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages92-97
Number of pages6
ISBN (Electronic)9798350347517
DOIs
Publication statusPublished - 2023
Event4th IEEE Industrial Electronics and Applications Conference, IEACon 2023 - Penang, Malaysia
Duration: 6 Nov 20237 Nov 2023

Publication series

NameIEACon 2023 - 2023 IEEE Industrial Electronics and Applications Conference

Conference

Conference4th IEEE Industrial Electronics and Applications Conference, IEACon 2023
Country/TerritoryMalaysia
CityPenang
Period6/11/237/11/23

Keywords

  • imbalanced class
  • intrusion detection system
  • oversampling and undersampling

Fingerprint

Dive into the research topics of 'Assessing the Effectiveness of Oversampling and Undersampling Techniques for Intrusion Detection on an Imbalanced Dataset'. Together they form a unique fingerprint.

Cite this