TY - GEN
T1 - Optimizing Process Discovery Quality Criteria and Model Measurements using Receiver Operating Characteristic Analysis and Infrequent Inductive Miner
AU - Budiraharjo, R.
AU - Prasetyo, Hanung Nindito
AU - Sarno, Riyanarto
AU - Sungkono, Kelly Rossa
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/4/8
Y1 - 2021/4/8
N2 - Generating process models that reflect close behavioral resemblance to the actual process Standard Operating Procedure (SOP) in process mining can be challenging without taking the four quality criteria of process discovery into account. The four quality criteria, i.e. fitness, precision, generalization, and simplicity, should be well balanced in order to produce proper process models which are aligned to the real-life executions. This paper proposes a method to optimize process discovery quality criteria (PDQC) by implementing different thresholds and analyzing calculation results using Receiver Operating Characteristic (ROC) curve and Infrequent Inductive Miner algorithm. This paper sets up two experiments with different scenarios to measure the calculations of quality criteria and the quality of generated models. The experiments compare two SOPs to the process models discovered by Infrequent Inductive Miner algorithm; hence the SOPs serve as references to determine the generated models quality. The purpose of applying two different scenarios in the experiments is to discover how well the Infrequent Inductive Miner thresholds can produce predictive models under these two different scenarios circumstances. This paper has been successful in predicting the best-fit model in reference to the SOPs by optimizing the four quality criteria of process discovery using ROC thresholds settings and by using infrequent inductive miner algorithm for models generation, and also in improving the accuracy of models measurements. The accuracy rate of the prediction model from Experiment 1 is 83%, while Experiment 2 yields an accuracy rate of 88%. The most optimal threshold settings to generate the best model in this paper are threshold 0.4 in Experiment 1 and threshold 0.5 in Experiment 2.
AB - Generating process models that reflect close behavioral resemblance to the actual process Standard Operating Procedure (SOP) in process mining can be challenging without taking the four quality criteria of process discovery into account. The four quality criteria, i.e. fitness, precision, generalization, and simplicity, should be well balanced in order to produce proper process models which are aligned to the real-life executions. This paper proposes a method to optimize process discovery quality criteria (PDQC) by implementing different thresholds and analyzing calculation results using Receiver Operating Characteristic (ROC) curve and Infrequent Inductive Miner algorithm. This paper sets up two experiments with different scenarios to measure the calculations of quality criteria and the quality of generated models. The experiments compare two SOPs to the process models discovered by Infrequent Inductive Miner algorithm; hence the SOPs serve as references to determine the generated models quality. The purpose of applying two different scenarios in the experiments is to discover how well the Infrequent Inductive Miner thresholds can produce predictive models under these two different scenarios circumstances. This paper has been successful in predicting the best-fit model in reference to the SOPs by optimizing the four quality criteria of process discovery using ROC thresholds settings and by using infrequent inductive miner algorithm for models generation, and also in improving the accuracy of models measurements. The accuracy rate of the prediction model from Experiment 1 is 83%, while Experiment 2 yields an accuracy rate of 88%. The most optimal threshold settings to generate the best model in this paper are threshold 0.4 in Experiment 1 and threshold 0.5 in Experiment 2.
KW - Infrequent Inductive Miner
KW - Process mining
KW - ROC curve
KW - event log
KW - process discovery
KW - quality criteria
KW - standard operating procedure
UR - http://www.scopus.com/inward/record.url?scp=85107961351&partnerID=8YFLogxK
U2 - 10.1109/APWiMob51111.2021.9435217
DO - 10.1109/APWiMob51111.2021.9435217
M3 - Conference contribution
AN - SCOPUS:85107961351
T3 - Proceedings - 2021 IEEE Asia Pacific Conference on Wireless and Mobile, APWiMob 2021
SP - 45
EP - 51
BT - Proceedings - 2021 IEEE Asia Pacific Conference on Wireless and Mobile, APWiMob 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE Asia Pacific Conference on Wireless and Mobile, APWiMob 2021
Y2 - 8 April 2021 through 9 April 2021
ER -