Synonym based feature expansion for Indonesian hate speech detection

Imam Ghozali, Kelly Rossa Sungkono, Riyanarto Sarno*, Rachmad Abdullah

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)

Abstract

Online hate speech is one of the negative impacts of internet-based social media development. Hate speech occurs due to a lack of public understanding of criticism and hate speech. The Indonesian government has regulations regarding hate speech, and most of the existing research about hate speech only focuses on feature extraction and classification methods. Therefore, this paper proposes methods to identify hate speech before a crime occurs. This paper presents an approach to detect hate speech by expanding synonyms in word embedding and shows the classification comparison result between Word2Vec and FastText with bidirectional long short-term memory which are processed using synonym expanding process and without it. The goal is to classify hate speech and non-hate speech. The best accuracy result without the synonym expanding process is 0.90, and the expanding synonym process is 0.93.

Original languageEnglish
Pages (from-to)1105-1112
Number of pages8
JournalInternational Journal of Electrical and Computer Engineering
Volume13
Issue number1
DOIs
Publication statusPublished - Feb 2023

Keywords

  • Bidirectional long short-term memory
  • FastText
  • Hate speech
  • Synonym
  • Word2Vec

Fingerprint

Dive into the research topics of 'Synonym based feature expansion for Indonesian hate speech detection'. Together they form a unique fingerprint.

Cite this