TY - GEN
T1 - Identifying Precautionary Measures for High-Risk Disease from Doctor's Answer Text Using LDA
AU - Juanita, Safitri
AU - Purwitasari, DIana
AU - Purnama, I. Ketut Eddy
AU - Purnomo, Mauridhi Hery
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/9/29
Y1 - 2021/9/29
N2 - The Online Health Consultation (OHC), which contains a QA collection of various diseases since 2014, has received an increasing number of visits due to the COVID-19. Based on the benefits and increasing health information need for people who seek information in OHC, health information related to precautionary measures to avoid diseases, especially high-risk diseases, become critical because not all seeker and readers of health information are diagnosed with certain diseases. However, It has currently unidentified whether the text of the doctor's answer corpus, especially in high-risk diseases, contains words that imply precautionary. This study aims to find the pattern of doctor's answer for high-risk diseases through the corpus of doctor's answer text on OHC by identifying whether the doctor's answer text contains words that imply precautionary against disease. Thus, it can help health information seekers and readers take precautionary against disease early on. This paper's contribution was to identify precautionary measures from doctor's answer text for high-risk disease in 2014-2021 using the best model of the two models, namely Single LDA (only LDA Method) and Hybrid LDA (a combination of LDA and Collapsed Gibbs Sampling). The results showed that the best model was Hybrid LDA, and medical experts identified groups of words with this model into four domains, namely symptoms/diagnosis, treatments, precautionary measurements, and general text. The pattern that emerges from the identification of precautionary measures shows (1) which precautionary measures are divided based on what disease, (2) Some words that mean precautionary measures also mean treatment or symptom/diagnosis.
AB - The Online Health Consultation (OHC), which contains a QA collection of various diseases since 2014, has received an increasing number of visits due to the COVID-19. Based on the benefits and increasing health information need for people who seek information in OHC, health information related to precautionary measures to avoid diseases, especially high-risk diseases, become critical because not all seeker and readers of health information are diagnosed with certain diseases. However, It has currently unidentified whether the text of the doctor's answer corpus, especially in high-risk diseases, contains words that imply precautionary. This study aims to find the pattern of doctor's answer for high-risk diseases through the corpus of doctor's answer text on OHC by identifying whether the doctor's answer text contains words that imply precautionary against disease. Thus, it can help health information seekers and readers take precautionary against disease early on. This paper's contribution was to identify precautionary measures from doctor's answer text for high-risk disease in 2014-2021 using the best model of the two models, namely Single LDA (only LDA Method) and Hybrid LDA (a combination of LDA and Collapsed Gibbs Sampling). The results showed that the best model was Hybrid LDA, and medical experts identified groups of words with this model into four domains, namely symptoms/diagnosis, treatments, precautionary measurements, and general text. The pattern that emerges from the identification of precautionary measures shows (1) which precautionary measures are divided based on what disease, (2) Some words that mean precautionary measures also mean treatment or symptom/diagnosis.
KW - Collapsed Gibbs Sampling
KW - Doctor's Answer Text
KW - LDA Model
KW - Precautionary Measures
KW - Topic Modeling
UR - http://www.scopus.com/inward/record.url?scp=85119970072&partnerID=8YFLogxK
U2 - 10.1109/IES53407.2021.9593997
DO - 10.1109/IES53407.2021.9593997
M3 - Conference contribution
AN - SCOPUS:85119970072
T3 - International Electronics Symposium 2021: Wireless Technologies and Intelligent Systems for Better Human Lives, IES 2021 - Proceedings
SP - 41
EP - 46
BT - International Electronics Symposium 2021
A2 - Yunanto, Andhik Ampuh
A2 - Kusuma N, Artiarini
A2 - Hermawan, Hendhi
A2 - Putra, Putu Agus Mahadi
A2 - Gamar, Farida
A2 - Ridwan, Mohamad
A2 - Prayogi, Yanuar Risah
A2 - Ruswiansari, Maretha
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 23rd International Electronics Symposium, IES 2021
Y2 - 29 September 2021 through 30 September 2021
ER -