Skip to main navigation Skip to search Skip to main content

Text Augmentation to Overcome Data Limitations in Sentiment Analysis for Bahasa Indonesia

  • Institut Teknologi Sepuluh Nopember

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper focuses on the challenges of sentiment analysis in Natural Language Processing (NLP), with a special emphasis on limited data scenarios. Sentiment analysis, a critical component of text understanding, faces specific difficulties when available data are scarce, leading to problems such as overfitting or underfitting. This study explores ensemble learning techniques, POS Tagging, and text augmentation techniques to overcome these data limitations. The experimental results demonstrate that text augmentation with the prefixes "me-", "ter-", "ber-", and "di-" is effective relative to increasing data variety and quantity, which contributes to improved sentiment analysis model performance. The ensemble learning model achieved an accuracy of 91.29% with significant improvements in precision, recall, and F1-score.

Original languageEnglish
Title of host publicationProceedings of 2024 IEEE International Conference on Data and Software Engineering
Subtitle of host publicationData-Driven Innovation: Transforming Industries and Societies, ICoDSE 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages217-222
Number of pages6
ISBN (Electronic)9798331506407
DOIs
Publication statusPublished - 2024
Event2024 IEEE International Conference on Data and Software Engineering, ICoDSE 2024 - Hybrid, Gorontalo, Indonesia
Duration: 30 Oct 202431 Oct 2024

Publication series

NameProceedings of 2024 IEEE International Conference on Data and Software Engineering: Data-Driven Innovation: Transforming Industries and Societies, ICoDSE 2024

Conference

Conference2024 IEEE International Conference on Data and Software Engineering, ICoDSE 2024
Country/TerritoryIndonesia
CityHybrid, Gorontalo
Period30/10/2431/10/24

Keywords

  • Data Limitations
  • Ensemble Learning
  • Natural Language Processing
  • Sentiment Analysis
  • Text Augmentation

Fingerprint

Dive into the research topics of 'Text Augmentation to Overcome Data Limitations in Sentiment Analysis for Bahasa Indonesia'. Together they form a unique fingerprint.

Cite this