TY - JOUR
T1 - Ontology-Based Traffic Accident Information Extraction on Twitter In Indonesia
AU - Rakhmawati, Nur Aini
AU - Awwab, Yasin
AU - Najib, Ahmad Choirun
AU - Irsyad, Ahmad
N1 - Publisher Copyright:
© IBERAMIA and the authors.
PY - 2022/12
Y1 - 2022/12
N2 - Traffic accidents become one of the events that often occur in Indonesia. From the three-monthly report by the Indonesian National Police Traffic Police, there are about 25,000 traffic accidents. Many social media users, especially Twitter, share information about traffic accidents. Twitter has various information regarding traffic accidents. Therefore, this study aims to process and map information about traffic accidents contained on Twitter in Indonesia language. We use the domain ontology and Named-Entity Recognition for the data extraction process. Named-Entity Recognition is used for obtaining keywords from a tweet based on class categories such as actor, time, location, and information on the cause of the accident. This research generates a Named Entity Recognition (NER) model that can provide a reasonably accurate level of accuracy. Also, we create an ontology that can categorize the causes of traffic accidents based on the Directorate General of the Land Transportation Office, Indonesia. We found that the traffic accidents are generally caused by inadequate vehicle conditions with the main problem in the vehicle caused by brake failure, while environmental factors rarely cause traffic accidents. Moreover, the vehicle is the subclass that mostly appears in the tweets, where car is the most popular actor, followed by truck and motorcycle.
AB - Traffic accidents become one of the events that often occur in Indonesia. From the three-monthly report by the Indonesian National Police Traffic Police, there are about 25,000 traffic accidents. Many social media users, especially Twitter, share information about traffic accidents. Twitter has various information regarding traffic accidents. Therefore, this study aims to process and map information about traffic accidents contained on Twitter in Indonesia language. We use the domain ontology and Named-Entity Recognition for the data extraction process. Named-Entity Recognition is used for obtaining keywords from a tweet based on class categories such as actor, time, location, and information on the cause of the accident. This research generates a Named Entity Recognition (NER) model that can provide a reasonably accurate level of accuracy. Also, we create an ontology that can categorize the causes of traffic accidents based on the Directorate General of the Land Transportation Office, Indonesia. We found that the traffic accidents are generally caused by inadequate vehicle conditions with the main problem in the vehicle caused by brake failure, while environmental factors rarely cause traffic accidents. Moreover, the vehicle is the subclass that mostly appears in the tweets, where car is the most popular actor, followed by truck and motorcycle.
KW - Twitter
KW - information extraction
KW - named entity recognition
KW - ontology
KW - traffic accident
UR - http://www.scopus.com/inward/record.url?scp=85138965546&partnerID=8YFLogxK
U2 - 10.4114/intartif.vol25iss70pp1-12
DO - 10.4114/intartif.vol25iss70pp1-12
M3 - Article
AN - SCOPUS:85138965546
SN - 1137-3601
VL - 25
SP - 1
EP - 12
JO - Inteligencia Artificial
JF - Inteligencia Artificial
IS - 70
ER -