Classification model based on url and content feature approach for detection phishing website in Indonesia

Febry Eka Purwiantono, Aris Tjahyanto

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

This research proposed a classification model that can be used to detect phishing website accurately. This study takes a case study from Indonesia because data used are sites using Bahasa Indonesia, hosted in Indonesia and frequently accessed by Internet users from Indonesia. Dataset used in this research consisted of approximately 102 authentic websites and 364 phishing websites. The proposed detection technique based on website analysis using the URL and content feature based approach. This classification model combines several heterogeneous features from previous research and proposes new URL and content feature based approach that are expected to improve detection performance when compared with previous research. Moreover, in the proposed classification model created a web crawler to extract feature vectors in this research. This research uses four different algorithms such as Sequential Minimal Optimization (SMO), Naive Bayes, Bagging and Multilayer Perceptron. The result, SMO, Naive Bayes, Bagging and Multilayer Perceptron have accuracy of approximately 89.27%, 93.78%, 95.49% and 92.70%. Algorithm has the best accuracy is Bagging, it will be used in this classification model to compare with classification model in previous research using same dataset. The result, accuracy of classification model in this research outperformed accuracy of classification model in previous research. The classification model in this research outperform 5.79% against classification model in previous research which only yielded 89.70% accuracy.

Original languageEnglish
Pages (from-to)4181-4191
Number of pages11
JournalJournal of Theoretical and Applied Information Technology
Volume95
Issue number17
Publication statusPublished - 15 Sept 2017

Keywords

  • Classification model
  • Detection
  • Feature
  • Indonesia
  • Phishing website

Fingerprint

Dive into the research topics of 'Classification model based on url and content feature approach for detection phishing website in Indonesia'. Together they form a unique fingerprint.

Cite this