TY - GEN
T1 - Predictive modeling of the first year evaluation based on demographics data
T2 - 3rd International Conference on Data and Software Engineering, ICoDSE 2016
AU - Fahrudin, Tora
AU - Buliali, Joko Lianto
AU - Fatichah, Chastine
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/5/30
Y1 - 2017/5/30
N2 - Student academic failure prediction is still interesting topics in the Educational Data Mining. One of the challenges is how to predict student academic failure as early as possible. This research focuses on predictive modeling of unsuccessful students in the first year evaluation. We propose a new concept of predictive modeling of the first year evaluation which combines 3 input data: demographics, academic and social media. The modeling can be divided into two sub modeling (normal period and extra period). In this paper, we focus on demographic data modeling (first sub-modeling) which correlated with the probability of a student to pass the first year evaluation on normal period. A Weka tool is used to get a pattern of data by using white box classifier (decision tree and rule base). Meanwhile, to solve the problem of unbalanced in our training data, we use data balancing scenario using same portion oversampling, random oversampling and SMOTE. From the testing result, we choose the best three student failure pattern of the F-Measure minor class value which obtained from 'One R' and 'ADTree' algorithms using Balancing scenario, the reason is because F-Measure describes the smallest error rate both FP (False Positive) and also FN (False Negative). From the best three of student failure pattern, we found that gender, selection path, study program and age are the attributes that are most correlated with the probability to pass the first year evaluation on extra period.
AB - Student academic failure prediction is still interesting topics in the Educational Data Mining. One of the challenges is how to predict student academic failure as early as possible. This research focuses on predictive modeling of unsuccessful students in the first year evaluation. We propose a new concept of predictive modeling of the first year evaluation which combines 3 input data: demographics, academic and social media. The modeling can be divided into two sub modeling (normal period and extra period). In this paper, we focus on demographic data modeling (first sub-modeling) which correlated with the probability of a student to pass the first year evaluation on normal period. A Weka tool is used to get a pattern of data by using white box classifier (decision tree and rule base). Meanwhile, to solve the problem of unbalanced in our training data, we use data balancing scenario using same portion oversampling, random oversampling and SMOTE. From the testing result, we choose the best three student failure pattern of the F-Measure minor class value which obtained from 'One R' and 'ADTree' algorithms using Balancing scenario, the reason is because F-Measure describes the smallest error rate both FP (False Positive) and also FN (False Negative). From the best three of student failure pattern, we found that gender, selection path, study program and age are the attributes that are most correlated with the probability to pass the first year evaluation on extra period.
KW - Data balancing
KW - Decision tree
KW - Educational data mining
KW - First year evaluation
KW - Rule base
KW - SMOTE
KW - Weka
UR - http://www.scopus.com/inward/record.url?scp=85025602674&partnerID=8YFLogxK
U2 - 10.1109/ICODSE.2016.7936158
DO - 10.1109/ICODSE.2016.7936158
M3 - Conference contribution
AN - SCOPUS:85025602674
T3 - Proceedings of 2016 International Conference on Data and Software Engineering, ICoDSE 2016
BT - Proceedings of 2016 International Conference on Data and Software Engineering, ICoDSE 2016
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 26 October 2016 through 27 October 2016
ER -