Machine Learning-Based Intrusion Detection on Multi-Class Imbalanced Dataset Using SMOTE

Akdeas Oktanae Widodo, Bambang Setiawan, Rarasmaya Indraswari*

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

3 Citations (Scopus)

Abstract

The rapid development of information technology has brought numerous benefits to society, but it has also led to increased security vulnerabilities in network systems. Intrusion detection systems (IDS) play a crucial role in identifying malicious activities, but they face challenges due to imbalanced datasets where the number of attack samples outweighs normal activities. This paper explores the performance of an IDS using SMOTE (Synthetic Minority Over-sampling Technique) and various classification algorithms to address imbalanced datasets and enhance detection of multi-class intrusions. Related works in the field of intrusion detection are reviewed, highlighting the effectiveness of different algorithms and techniques. The proposed work presents a model that combines SMOTE with log normalization and feature selection to improve IDS performance. Experiments are conducted on the NSL-KDD and CIC-IDS2017 datasets, evaluating different oversampling configurations and machine learning models. The results show that applying SMOTE improves overall performance, with high accuracy, precision, recall, and F1-score. Feature selection has minimal impact on model performance, suggesting the presence of redundant features. The study concludes that SMOTE effectively addresses class imbalance and enhances IDS performance, emphasizing the importance of incorporating oversampling techniques in intrusion detection systems.

Original languageEnglish
Pages (from-to)578-583
Number of pages6
JournalProcedia Computer Science
Volume234
DOIs
Publication statusPublished - 2024
Event7th Information Systems International Conference, ISICO 2023 - Washington, United States
Duration: 26 Jul 202328 Jul 2023

Keywords

  • Intrusion detection
  • SMOTE
  • imbalanced dataset
  • machine learning

Fingerprint

Dive into the research topics of 'Machine Learning-Based Intrusion Detection on Multi-Class Imbalanced Dataset Using SMOTE'. Together they form a unique fingerprint.

Cite this