TY - GEN
T1 - Creating A Patient Data Redundancy Detection Model using Deep Learning Methods
AU - Widodo, Prevandito Wahyu
AU - Rakhmawati, Nur Aini
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Redundancy of patient data in healthcare facilities often arises because of double entry. Although Standard Operating Procedures (SOPs) have been enforced, patient conditions and situations during input tend to dominate input results. Entity matching is a potential solution for overcoming this problem. However, with the growth in data volume and the complexity of relationships between entities, entity matching often faces constraints related to the quality of results, time efficiency, and resources. As a solution, deep learning offers an alternative approach to entity matching, especially in situations where the data have a large size and a wide variety of entity representations, such as patient data. This study aimed to improve the validity of patient data by applying deep learning to entity matching. Four methods were planned for testing: SIF, RNN, Attention, and Hybrid. Although previous research has shown that the hybrid method significantly outperforms the other three methods in contexts with similar datasets, our findings demonstrate that the RNN method is more precise. From a dataset of 17,075 entries, we successfully identified 199 matching data pairs (positive labels), highlighting a significant initial imbalance. We employed an undersampling method to manage this imbalance. The results were promising, with an F1-score of 98.87.
AB - Redundancy of patient data in healthcare facilities often arises because of double entry. Although Standard Operating Procedures (SOPs) have been enforced, patient conditions and situations during input tend to dominate input results. Entity matching is a potential solution for overcoming this problem. However, with the growth in data volume and the complexity of relationships between entities, entity matching often faces constraints related to the quality of results, time efficiency, and resources. As a solution, deep learning offers an alternative approach to entity matching, especially in situations where the data have a large size and a wide variety of entity representations, such as patient data. This study aimed to improve the validity of patient data by applying deep learning to entity matching. Four methods were planned for testing: SIF, RNN, Attention, and Hybrid. Although previous research has shown that the hybrid method significantly outperforms the other three methods in contexts with similar datasets, our findings demonstrate that the RNN method is more precise. From a dataset of 17,075 entries, we successfully identified 199 matching data pairs (positive labels), highlighting a significant initial imbalance. We employed an undersampling method to manage this imbalance. The results were promising, with an F1-score of 98.87.
KW - Data Validity
KW - Deep Learning
KW - Entity Matching
KW - Redundancy Data
KW - patient
UR - http://www.scopus.com/inward/record.url?scp=85202795711&partnerID=8YFLogxK
U2 - 10.1109/ICICoS62600.2024.10636911
DO - 10.1109/ICICoS62600.2024.10636911
M3 - Conference contribution
AN - SCOPUS:85202795711
T3 - Proceedings - International Conference on Informatics and Computational Sciences
SP - 262
EP - 266
BT - 2024 7th International Conference on Informatics and Computational Sciences, ICICoS 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Informatics and Computational Sciences, ICICoS 2024
Y2 - 17 July 2024 through 18 July 2024
ER -