A Deep Learning Approach for Word Segmentation in Javanese Letter Manuscript Transliteration

  • Muhammad Nevin
  • , I. Kadek Agus Ariesta Putra
  • , Dwinanda Bagoes Ansori
  • , Riyanarto Sarno
  • , Agus Haryono

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Traditionally written using the Javanese Letter or 'Aksara Jawa' script, the Javanese language encompasses a rich corpus of manuscripts that record diverse subjects such as history, culture, and traditional practices. With over 121,668 Javanese manuscript titles identified, only a fraction has been transliterated into alphabetical writing, underscoring the urgent need for efficient language preservation methods. This study evaluates deep learning models for word segmentation in Javanese manuscript transliteration. Results from experiments conducted on an unseen dataset reveal that the Bidirectional Long Short-Term Memory (BiLSTM) model outperforms other architectures consistently across all metrics. With an accuracy of 98.62% and superior scores in f1-score, precision, and recall, the BiLSTM model demonstrates robustness in capturing the intricate linguistic patterns and textual structures inherent in Javanese manuscripts.

Original languageEnglish
Title of host publicationProceedings - 2024 2nd International Conference on Technology Innovation and Its Applications, ICTIIA 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350351613
DOIs
Publication statusPublished - 2024
Event2nd International Conference on Technology Innovation and Its Applications, ICTIIA 2024 - Medan, Indonesia
Duration: 12 Sept 202413 Sept 2024

Publication series

NameProceedings - 2024 2nd International Conference on Technology Innovation and Its Applications, ICTIIA 2024

Conference

Conference2nd International Conference on Technology Innovation and Its Applications, ICTIIA 2024
Country/TerritoryIndonesia
CityMedan
Period12/09/2413/09/24

Keywords

  • BiLSTM
  • deep learning
  • javanese manuscript
  • natural language processing
  • word segmentation

Fingerprint

Dive into the research topics of 'A Deep Learning Approach for Word Segmentation in Javanese Letter Manuscript Transliteration'. Together they form a unique fingerprint.

Cite this