TY - GEN
T1 - Flight Delay Prediction for Mitigation of Airport Commercial Revenue Losses Using Machine Learning on Imbalanced Dataset
AU - Sugara, Rae Arun
AU - Purwitasari, Diana
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Flight delay is one of the factors that affect the decline in customer satisfaction and airport revenue. In addition to influencing customer perceptions of airport services, flight delay also has an impact on decreasing airport revenue and operation. This study models a flight delay prediction, and the process is carried out using Decision Tree, Random Forest, Gradient Boosted Tree, and XGBoost Tree algorithms. This study has also used and merged the weather characteristic data as secondary data to the airport operational flight data. To anticipate the imbalanced class, several sampling techniques were applied. Synthetic Minority Over Sampling Technique (SMOTE), Random Over-Sampling (ROS), Random Under-Sampling (RUS), and combining ROS with RUS are being used. The result of processing the analysis is in the form of a model to predict the category of flight delay. The model has been evaluated by using the Confusion Matrix and Area Under ROC Curve (AUC) value. The result of this study shows the Random Forest classifier with the combination of ROS + RUS technique and data split ratio of 90:10 gave the highest accuracy, error rate, and AUC value as shown as 82.58%, 17.42%, and 81.1% respectively on data testing. The result of the flight delay prediction model is expected to be a strategic recommendation for determining airport policies in the future. By implementing the best strategy related to the airport operation, it could carry out commercial planning in order to optimize airport commercial revenue.
AB - Flight delay is one of the factors that affect the decline in customer satisfaction and airport revenue. In addition to influencing customer perceptions of airport services, flight delay also has an impact on decreasing airport revenue and operation. This study models a flight delay prediction, and the process is carried out using Decision Tree, Random Forest, Gradient Boosted Tree, and XGBoost Tree algorithms. This study has also used and merged the weather characteristic data as secondary data to the airport operational flight data. To anticipate the imbalanced class, several sampling techniques were applied. Synthetic Minority Over Sampling Technique (SMOTE), Random Over-Sampling (ROS), Random Under-Sampling (RUS), and combining ROS with RUS are being used. The result of processing the analysis is in the form of a model to predict the category of flight delay. The model has been evaluated by using the Confusion Matrix and Area Under ROC Curve (AUC) value. The result of this study shows the Random Forest classifier with the combination of ROS + RUS technique and data split ratio of 90:10 gave the highest accuracy, error rate, and AUC value as shown as 82.58%, 17.42%, and 81.1% respectively on data testing. The result of the flight delay prediction model is expected to be a strategic recommendation for determining airport policies in the future. By implementing the best strategy related to the airport operation, it could carry out commercial planning in order to optimize airport commercial revenue.
KW - Area Under Curve (AUC)
KW - classifier
KW - commercial revenue
KW - confusion matrix
KW - flight delay
KW - sampling technique
UR - http://www.scopus.com/inward/record.url?scp=85149138507&partnerID=8YFLogxK
U2 - 10.1109/CENIM56801.2022.10037369
DO - 10.1109/CENIM56801.2022.10037369
M3 - Conference contribution
AN - SCOPUS:85149138507
T3 - Proceeding of the International Conference on Computer Engineering, Network and Intelligent Multimedia, CENIM 2022
SP - 6
EP - 13
BT - Proceeding of the International Conference on Computer Engineering, Network and Intelligent Multimedia, CENIM 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 International Conference on Computer Engineering, Network and Intelligent Multimedia, CENIM 2022
Y2 - 22 November 2022 through 23 November 2022
ER -