A Combination of Term Frequency and Topic Modeling with Public Attention to Detect Hot Topics on Texts of Indonesian Online News

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

In the age of digitalization, accurately identifying trending topics within online news articles is crucial for understanding public interests and concerns. This study introduces an approach that combines Term Frequency-Inverse Document Frequency (TF-IDF) and Topic Modeling techniques to pinpoint viral keywords in news content. While TF -IDF conventionally assigns word weights based on document frequency, it often struggles to capture emerging trends effectively. To overcome this limitation, our research presents a modified TF -IDF approach that incorporates temporal sensitivity, enabling us to detect trends in a more timely manner. Our methodology takes into account not only word frequency but also factors such as user attention and time in a specific period, on how the keyword appears in every month, thereby improving the accuracy of identifying the burst effect of the keyword. Our results demonstrate that our modified TF-IDF approach surpasses by 0.25 from TF -IDF in identifying viral content through KMeans clustering and exhibits topic modeling accuracy. This is especially evident when analyzing discussions related to events in 2022, as validated using Google Trend viral keywords. Despite our approach's improvements, it remains constrained by the need for periodic data updates and processing, preventing real-time trend detection. In conclusion, our research seeks to enhance the reliability of digital data for informed decision-making, aligning with sustainability goals. By providing a novel approach to identifying viral keywords in online news, we aim to contribute to a better understanding of public interests and concerns in the digital age.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE 7th International Conference on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages363-368
Number of pages6
ISBN (Electronic)9798350382266
DOIs
Publication statusPublished - 2023
Event7th International Conference on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2023 - Purwokerto, Indonesia
Duration: 29 Nov 202330 Nov 2023

Publication series

NameProceedings - 2023 IEEE 7th International Conference on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2023

Conference

Conference7th International Conference on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2023
Country/TerritoryIndonesia
CityPurwokerto
Period29/11/2330/11/23

Keywords

  • Attention
  • Hot Topics
  • TF-IDF
  • Time
  • Topic Modeling

Fingerprint

Dive into the research topics of 'A Combination of Term Frequency and Topic Modeling with Public Attention to Detect Hot Topics on Texts of Indonesian Online News'. Together they form a unique fingerprint.

Cite this