TY - GEN
T1 - Comparative Study of Machine Learning Algorithm on Linguistic Distinctions over Text Related to Human Trafficking and Sexual Exploitation
AU - Hartawan, Danica Aurelie
AU - Santoso, Bagus Jati
AU - Pratomo, Baskoro Adi
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Human trafficking and sexual exploitation (HTSE) are grave global issues that demand comprehensive understanding and effective countermeasures. In total, this issue - also known as modern slavery - affected an estimated 49.6 million people in 2021 and earns traffickers at least $150 billion annually, making it one of the world's most profitable crimes. This crime can penetrate our daily life through internet advertisement, trying to trap non-tech savvy people. Automated analysis for such adverts is essential. It can be used for identifying linguistic patterns in human trafficking and sexual exploitation-related adverstisements. This research employs various machine learning algorithms (i.e., Naïve Bayes, Random Forest, Decision Trees) to classify benign and HTSE advertisements. From our experiments, Random Forest performed best, achieving a high F1 score (0.962), balancing Precision and Recall effectively. Naïve Bayes also showed promising results, while Gradient Boosting had variable F1 scores, and the Tree algorithm scored the lowest. This analysis provides insights into algorithm capabilities and limitations in addressing linguistic distinctions related to human trafficking and sexual exploitation, contributing to better detection systems. By applying these algorithms to a diverse dataset, we aim to enhance our understanding of linguistic cues in addressing these societal challenges and consider potential solutions, policy implications, and future research.
AB - Human trafficking and sexual exploitation (HTSE) are grave global issues that demand comprehensive understanding and effective countermeasures. In total, this issue - also known as modern slavery - affected an estimated 49.6 million people in 2021 and earns traffickers at least $150 billion annually, making it one of the world's most profitable crimes. This crime can penetrate our daily life through internet advertisement, trying to trap non-tech savvy people. Automated analysis for such adverts is essential. It can be used for identifying linguistic patterns in human trafficking and sexual exploitation-related adverstisements. This research employs various machine learning algorithms (i.e., Naïve Bayes, Random Forest, Decision Trees) to classify benign and HTSE advertisements. From our experiments, Random Forest performed best, achieving a high F1 score (0.962), balancing Precision and Recall effectively. Naïve Bayes also showed promising results, while Gradient Boosting had variable F1 scores, and the Tree algorithm scored the lowest. This analysis provides insights into algorithm capabilities and limitations in addressing linguistic distinctions related to human trafficking and sexual exploitation, contributing to better detection systems. By applying these algorithms to a diverse dataset, we aim to enhance our understanding of linguistic cues in addressing these societal challenges and consider potential solutions, policy implications, and future research.
KW - Human Trafficking
KW - Linguistic Distinctions
KW - Naïve Bayes
KW - Random Forest
KW - Sexual Exploitation
KW - Text Analysis
KW - Trees
UR - http://www.scopus.com/inward/record.url?scp=85186518628&partnerID=8YFLogxK
U2 - 10.1109/ICAMIMIA60881.2023.10427655
DO - 10.1109/ICAMIMIA60881.2023.10427655
M3 - Conference contribution
AN - SCOPUS:85186518628
T3 - 2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation, ICAMIMIA 2023 - Proceedings
SP - 442
EP - 447
BT - 2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation, ICAMIMIA 2023 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation, ICAMIMIA 2023
Y2 - 14 November 2023 through 15 November 2023
ER -