Abstract
Electronic Medical Record (EMR) is an important element of information technology in healthcare sector. EMR is an electronic record containing health-related information on patients that can be created and managed by authorized physician and staff in a healthcare service organization. EMR is a framework for determining diagnosis and treatment. EMR has free text and unstructured format which makes it more difficult to extract the hidden information as a decision support system. This study performs classification from Indonesian EMR for clinical decision support system (CDSS) in classifying patient diagnosis using Term Frequency-Inverse Document Frequency (TF-IDF) for feature extraction and Support Vector Machine (SVM) for classifier method. SVM is a powerful algorithm in high-dimensional data such as in textual data processing. The focus diagnoses classified in this paper are tuberculosis, cancer, diabetes mellitus, hypertension, and chronic kidney which have high prevalence rates in Indonesia. The model is built by considering the kernel function and the use of stopword removal or without stopword removal. The result showed that TF - IDF and SVM method could be used effectively to predict diagnosis with stop word removal. Classification performance increased with stopword removal on all SVM kernels with accuracy in linear kernel 89.91 %, polynomial kernel 90.58%, RBF kernel 90.75%, and sigmoid kernel 91.03%..
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2021 International Seminar on Application for Technology of Information and Communication |
| Subtitle of host publication | IT Opportunities and Creativities for Digital Innovation and Communication within Global Pandemic, iSemantic 2021 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 243-248 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781665428040 |
| DOIs | |
| Publication status | Published - 18 Sept 2021 |
| Event | 2021 International Seminar on Application for Technology of Information and Communication, iSemantic 2021 - Semarang, Indonesia Duration: 18 Sept 2021 → 19 Sept 2021 |
Publication series
| Name | Proceedings - 2021 International Seminar on Application for Technology of Information and Communication: IT Opportunities and Creativities for Digital Innovation and Communication within Global Pandemic, iSemantic 2021 |
|---|
Conference
| Conference | 2021 International Seminar on Application for Technology of Information and Communication, iSemantic 2021 |
|---|---|
| Country/Territory | Indonesia |
| City | Semarang |
| Period | 18/09/21 → 19/09/21 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Electronic Medical Record
- Support Vector Machine
- Text Mining
Fingerprint
Dive into the research topics of 'Patient Diagnosis Classification based on Electronic Medical Record using Text Mining and Support Vector Machine'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver