Noun phrases extraction using shallow parsing with C4.5 decision tree algorithm for Indonesian Language ontology building

Joan Santoso, G. Gunawan, Hermes Vincentius Gani, Eko Mulyanto Yuniarno, Mochamad Hariadi, Mauridhi Hery Purnomo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

Ontology describes a set of concept or entity and each relation. Ontology as knowledge representation usually has a large structure because it can cover a wide area topics. Ontology building process is divided into two subprocesses, those are term extraction and relation formation. Term extraction in ontology building is done for extracting concept or entity before each relation is obtained. Main objective in this research is to extract noun phrases using shallow parsing algorithm based on C4.5 decision tree as candidate concept or term for ontology building process in Indonesian Text. One of the advantages of using shallow parsing is it can recover syntactic information efficiently and reliably from unrestricted text. For our dataset, we use Indonesian Language online newspapers for one month. Based on our experiments, it concludes that our proposed method can perform well for Indonesian Language noun phrase identification with average F-score 84.63%.

Original languageEnglish
Title of host publication2015 15th International Symposium on Communications and Information Technologies, ISCIT 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages149-152
Number of pages4
ISBN (Electronic)9781467368209
DOIs
Publication statusPublished - 22 Apr 2016
Event15th International Symposium on Communications and Information Technologies, ISCIT 2015 - Nara, Japan
Duration: 7 Oct 20159 Oct 2015

Publication series

Name2015 15th International Symposium on Communications and Information Technologies, ISCIT 2015

Conference

Conference15th International Symposium on Communications and Information Technologies, ISCIT 2015
Country/TerritoryJapan
CityNara
Period7/10/159/10/15

Keywords

  • Data Mining
  • Decision Tree
  • Indonesian Language
  • Noun Phrase
  • Shallow Parsing
  • Term Extraction

Fingerprint

Dive into the research topics of 'Noun phrases extraction using shallow parsing with C4.5 decision tree algorithm for Indonesian Language ontology building'. Together they form a unique fingerprint.

Cite this