TY - JOUR
T1 - New filtering scheme based on term weighting to improve object based opinion mining on tourism product reviews
AU - Afrizal, Ahimsa Denhas
AU - Rakhmawati, Nur Aini
AU - Tjahyanto, Aris
N1 - Publisher Copyright:
© 2019 The Authors.
PY - 2019
Y1 - 2019
N2 - Reviews are an essential thing in tourism industry. Opinion mining used for processing a massive amount of review data, so it can be more useful for the industry. The utilization of filtering can improve the feature extraction result from object based on opinion mining and can improve opinion classification result generally. However, there is no proven method yet to develop filter data automatically. This work applies several term weighting methods such as TF-IDF mTFIDF and BM25 to develop filter data automatically. The result from this research is the best term weighting method for developing filter data, that can improve the feature extraction and opinion mining relatively. TFIDF become the best term weighting method applied for filter data combined with the most frequent objects, The accuracy is 37.98%, the precision is 50.69%, the recall is 44,28%, and F-measure 47.27% for hotel data. Meanwhile, for restaurant data, the accuracy is 37.98%, precision is 50.69%, recall is 44.28%, and F-measure 47.27%.
AB - Reviews are an essential thing in tourism industry. Opinion mining used for processing a massive amount of review data, so it can be more useful for the industry. The utilization of filtering can improve the feature extraction result from object based on opinion mining and can improve opinion classification result generally. However, there is no proven method yet to develop filter data automatically. This work applies several term weighting methods such as TF-IDF mTFIDF and BM25 to develop filter data automatically. The result from this research is the best term weighting method for developing filter data, that can improve the feature extraction and opinion mining relatively. TFIDF become the best term weighting method applied for filter data combined with the most frequent objects, The accuracy is 37.98%, the precision is 50.69%, the recall is 44,28%, and F-measure 47.27% for hotel data. Meanwhile, for restaurant data, the accuracy is 37.98%, precision is 50.69%, recall is 44.28%, and F-measure 47.27%.
KW - Feature extraction
KW - Filtering
KW - Opinion mining
KW - Term-weighting
UR - http://www.scopus.com/inward/record.url?scp=85078950566&partnerID=8YFLogxK
U2 - 10.1016/j.procs.2019.11.186
DO - 10.1016/j.procs.2019.11.186
M3 - Conference article
AN - SCOPUS:85078950566
SN - 1877-0509
VL - 161
SP - 805
EP - 812
JO - Procedia Computer Science
JF - Procedia Computer Science
T2 - 5th Information Systems International Conference, ISICO 2019
Y2 - 23 July 2019 through 24 July 2019
ER -