Improved performance of fake account classifiers with percentage overlap features selection

Aris Tjahyanto*, Rivanda Putra Pratama, Ary Mazharuddin Shiddiqi

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Feature selection plays a crucial role in the development of high-performance classification models. We propose an innovative method for detecting fake accounts. This method leverages the percentage overlap technique to refine feature selection. We introduce our technique upon earlier work that showcased the enhanced efficacy of the Naïve Bayesian classifier through dataset normalization. Our study employs a dataset of account profiles sourced from Twitter, which we normalize using the Min-Max method. We analyze the results through a series of comprehensive experiments involving diverse classification algorithms—such as Naïve Bayes, decision tree, k-nearest neighbors (KNN), deep learning, and support vector machines (SVM). Our experimental results demonstrate a 100% accuracy achieved by the SVM and deep learning classifiers. The results are attributed to the percentage overlap technique, which facilitates the identification of four highly informative features. These findings outperform models with more extensive feature sets, underscoring the efficacy of our approach.

Original languageEnglish
Pages (from-to)1585-1595
Number of pages11
JournalIAES International Journal of Artificial Intelligence
Volume13
Issue number2
DOIs
Publication statusPublished - Jun 2024

Keywords

  • Bots
  • Fake accounts
  • Fake users
  • Feature selection
  • Internet

Fingerprint

Dive into the research topics of 'Improved performance of fake account classifiers with percentage overlap features selection'. Together they form a unique fingerprint.

Cite this