TY - GEN
T1 - Extracting Temporal-Based Spatial Features in Imbalanced Data for Predicting Dengue Virus Transmission
AU - Setiyoutami, Arfinda
AU - Anggraeni, Wiwik
AU - Purwitasari, Diana
AU - Yuniarno, Eko Mulyanto
AU - Purnomo, Mauridhi Hery
N1 - Publisher Copyright:
© 2021, Springer Nature Singapore Pte Ltd.
PY - 2021
Y1 - 2021
N2 - Since the movements of mosquito or human can potentially influence dengue virus transmission, recognizing location characteristics defined as spatial factors is necessary for predicting patient status. We proposed feature extraction that considers location characteristics through previous dengue cases and the high possibility of encounters between people with different backgrounds. The number of incoming populations, school buildings and population density was included as the location characteristics. Besides the information of the spatial factors, the number of dengue cases set within a particular time window was specified for virus transmission period. Our experiments obtained two datasets of dengue fever which were patient registry and location characteristics of Malang Regency. Manually recorded Registry Data only contained positive group data and not the negative group when the patients were healthy. Thus, the proposed extraction method also included the process of generating negative data from the existing positive data. Then, we preprocessed the data by cleaning, imputing, encoding, and merging, such that there were four features representing previous dengue cases and eight features describing location characteristics. The experiments demonstrated that by using some ranked features the prediction had a better accuracy of 78.7% compared to using all features. Temporal-based features displayed better performances, but the result was improved in the wider location where people met.
AB - Since the movements of mosquito or human can potentially influence dengue virus transmission, recognizing location characteristics defined as spatial factors is necessary for predicting patient status. We proposed feature extraction that considers location characteristics through previous dengue cases and the high possibility of encounters between people with different backgrounds. The number of incoming populations, school buildings and population density was included as the location characteristics. Besides the information of the spatial factors, the number of dengue cases set within a particular time window was specified for virus transmission period. Our experiments obtained two datasets of dengue fever which were patient registry and location characteristics of Malang Regency. Manually recorded Registry Data only contained positive group data and not the negative group when the patients were healthy. Thus, the proposed extraction method also included the process of generating negative data from the existing positive data. Then, we preprocessed the data by cleaning, imputing, encoding, and merging, such that there were four features representing previous dengue cases and eight features describing location characteristics. The experiments demonstrated that by using some ranked features the prediction had a better accuracy of 78.7% compared to using all features. Temporal-based features displayed better performances, but the result was improved in the wider location where people met.
KW - Imbalanced data
KW - Location characteristic
KW - Predicting dengue virus transmission
KW - Temporal-based spatial features
UR - http://www.scopus.com/inward/record.url?scp=85096492413&partnerID=8YFLogxK
U2 - 10.1007/978-981-15-4409-5_65
DO - 10.1007/978-981-15-4409-5_65
M3 - Conference contribution
AN - SCOPUS:85096492413
SN - 9789811544088
T3 - Advances in Intelligent Systems and Computing
SP - 731
EP - 742
BT - Advances in Computer, Communication and Computational Sciences - Proceedings of IC4S 2019
A2 - Bhatia, Sanjiv K.
A2 - Tiwari, Shailesh
A2 - Ruidan, Su
A2 - Trivedi, Munesh Chandra
A2 - Mishra, K. K.
PB - Springer Science and Business Media Deutschland GmbH
T2 - International Conference on Computer, Communication and Computational Sciences, IC4S 2019
Y2 - 11 October 2019 through 12 October 2019
ER -