Comparative Study of Machine Learning Algorithm on Linguistic Distinctions over Text Related to Human Trafficking and Sexual Exploitation

Danica Aurelie Hartawan*, Bagus Jati Santoso, Baskoro Adi Pratomo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Human trafficking and sexual exploitation (HTSE) are grave global issues that demand comprehensive understanding and effective countermeasures. In total, this issue - also known as modern slavery - affected an estimated 49.6 million people in 2021 and earns traffickers at least $150 billion annually, making it one of the world's most profitable crimes. This crime can penetrate our daily life through internet advertisement, trying to trap non-tech savvy people. Automated analysis for such adverts is essential. It can be used for identifying linguistic patterns in human trafficking and sexual exploitation-related adverstisements. This research employs various machine learning algorithms (i.e., Naïve Bayes, Random Forest, Decision Trees) to classify benign and HTSE advertisements. From our experiments, Random Forest performed best, achieving a high F1 score (0.962), balancing Precision and Recall effectively. Naïve Bayes also showed promising results, while Gradient Boosting had variable F1 scores, and the Tree algorithm scored the lowest. This analysis provides insights into algorithm capabilities and limitations in addressing linguistic distinctions related to human trafficking and sexual exploitation, contributing to better detection systems. By applying these algorithms to a diverse dataset, we aim to enhance our understanding of linguistic cues in addressing these societal challenges and consider potential solutions, policy implications, and future research.

Original languageEnglish
Title of host publication2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation, ICAMIMIA 2023 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages442-447
Number of pages6
ISBN (Electronic)9798350309225
DOIs
Publication statusPublished - 2023
Event2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation, ICAMIMIA 2023 - Lombok, Indonesia
Duration: 14 Nov 202315 Nov 2023

Publication series

Name2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation, ICAMIMIA 2023 - Proceedings

Conference

Conference2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation, ICAMIMIA 2023
Country/TerritoryIndonesia
CityLombok
Period14/11/2315/11/23

Keywords

  • Human Trafficking
  • Linguistic Distinctions
  • Naïve Bayes
  • Random Forest
  • Sexual Exploitation
  • Text Analysis
  • Trees

Fingerprint

Dive into the research topics of 'Comparative Study of Machine Learning Algorithm on Linguistic Distinctions over Text Related to Human Trafficking and Sexual Exploitation'. Together they form a unique fingerprint.

Cite this