Mobile app review labeling using lda similarity and term frequency-inverse cluster frequency (TF-ICF)

Alifia Puspaningrum, Daniel Siahaan, Chastine Fatichah

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Citations (Scopus)

Abstract

User review mining has attracted many researchers to analyze and develop innovative models. The models provide technical recommendation for software developers to make decisions during software maintenance a software evolution. One of the recommendations is user review categorization. There are many categorizations have been popularly used, namely bug errors, feature requests, and noninformative. There are many methods that have been done to classify user reviews. One of the classification methods is Latent Dirichlet Allocation (LDA). LDA is a topic modelling method which ables to map hidden topics resided in a document. Thus, one of techniques to map hidden topics into categories is calculating term similarity value between hidden topic and the pre-defined signifier term list. However, the limited signifier term list of each category becomes a problem. Meanwhile Term Frequency-Inverse Corpus Frequency (TF-ICF) is able to take important terms on a cluster. Therefore, this paper introduces a method that combines TF-ICF with LDA clustering based on similarity (LDAS TF-ICF) to overcome it. The classification results were calculated by using precision, recall, and F1-score. The results show the method can outperform LDA. The best performance of LDAS TF-ICF occured when 75% expanded term list was used, given the precision, recall, dan f-measure values 0.564, 0.507, and 0.491, respectively.

Original languageEnglish
Title of host publicationProceedings of 2018 10th International Conference on Information Technology and Electrical Engineering
Subtitle of host publicationSmart Technology for Better Society, ICITEE 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages365-370
Number of pages6
ISBN (Electronic)9781538647394
DOIs
Publication statusPublished - 13 Nov 2018
Event10th International Conference on Information Technology and Electrical Engineering, ICITEE 2018 - Bali, Indonesia
Duration: 24 Jul 201826 Jul 2018

Publication series

NameProceedings of 2018 10th International Conference on Information Technology and Electrical Engineering: Smart Technology for Better Society, ICITEE 2018

Conference

Conference10th International Conference on Information Technology and Electrical Engineering, ICITEE 2018
Country/TerritoryIndonesia
CityBali
Period24/07/1826/07/18

Keywords

  • LDA
  • Review Semantic Similarity
  • Software Evolution
  • Software Maintenance
  • TF-ICF

Fingerprint

Dive into the research topics of 'Mobile app review labeling using lda similarity and term frequency-inverse cluster frequency (TF-ICF)'. Together they form a unique fingerprint.

Cite this