TY - JOUR
T1 - Indonesian News Stance Classification Based on Hybrid Bidirectional LSTM and Transformer Based Embedding
AU - Setiawan, Esther Irawati
AU - Dharmawan, Willyanto
AU - Halim, Kevin Jonathan
AU - Santoso, Joan
AU - Ferdinandus, F. X.
AU - Fujisawa, Kimiya
AU - Purnomo, Mauridhi Hery
N1 - Publisher Copyright:
© (2024), (Intelligent Network and Systems Society). All rights reserved.
PY - 2024
Y1 - 2024
N2 - Stance classification is used to understand the relationship between sentences so that the model can recognize the attitude of a response to a topic, where the attitudes are classified into three, namely supporting (for), neutral (observing), and opposing (against). Furthermore, stance classification could aid the automatic fake news detection. This research is specially made for Indonesian news titles. The proposed model used to recognize these news attitudes is Bidirectional Long Short-Term Memory (Bi-LSTM). Thus, to obtain the word representation vector, the pre-trained Bidirectional Encoder Representations from Transformers (BERT) embedding model from indoBERT is used to process words in Indonesian. In Bi-LSTM, each word representation will be processed twice in a forward and backward direction sequentially, so to get a vector representation of the sentence from the input, the output is taken from the results of the representation process of the last word in the forward direction process and the representation process results of the first word in the backward direction. Then the results of the two outputs are combined to serve as a sentence representation. Based on the test results on the Indonesian news dataset, the model for stance classification task was able to achieve an F1 score with an average of 78.30%, with an F1 score label for (supportive) of 73.10%, label observing (neutral) of 89.57%, and label against (against) by 72.23%. The performance is on par with the results of experiments with several Large Language Models currently available.
AB - Stance classification is used to understand the relationship between sentences so that the model can recognize the attitude of a response to a topic, where the attitudes are classified into three, namely supporting (for), neutral (observing), and opposing (against). Furthermore, stance classification could aid the automatic fake news detection. This research is specially made for Indonesian news titles. The proposed model used to recognize these news attitudes is Bidirectional Long Short-Term Memory (Bi-LSTM). Thus, to obtain the word representation vector, the pre-trained Bidirectional Encoder Representations from Transformers (BERT) embedding model from indoBERT is used to process words in Indonesian. In Bi-LSTM, each word representation will be processed twice in a forward and backward direction sequentially, so to get a vector representation of the sentence from the input, the output is taken from the results of the representation process of the last word in the forward direction process and the representation process results of the first word in the backward direction. Then the results of the two outputs are combined to serve as a sentence representation. Based on the test results on the Indonesian news dataset, the model for stance classification task was able to achieve an F1 score with an average of 78.30%, with an F1 score label for (supportive) of 73.10%, label observing (neutral) of 89.57%, and label against (against) by 72.23%. The performance is on par with the results of experiments with several Large Language Models currently available.
KW - BERT embedding
KW - Bi-LSTM
KW - Indonesian news
KW - Large language models
KW - Stance classification
UR - http://www.scopus.com/inward/record.url?scp=85201509438&partnerID=8YFLogxK
U2 - 10.22266/ijies2024.1031.41
DO - 10.22266/ijies2024.1031.41
M3 - Article
AN - SCOPUS:85201509438
SN - 2185-310X
VL - 17
SP - 517
EP - 537
JO - International Journal of Intelligent Engineering and Systems
JF - International Journal of Intelligent Engineering and Systems
IS - 5
ER -