Abstract
Searching for health-related information online is becoming more difficult due to the proliferation of multiple-meaning phrases such as semantic words. This study examines the semantic process in medical terms using a collection of doctor's answer texts which requires finding an appropriate model for recognizing text in pairs of comparable Indonesian medical phrases or terminology synonyms. This study contributes to finding an automatic semantic text detection using a word embedding approach to identify text in pairs of similar Indonesian medical terms. We selected 108 pairs of annotated medical terms (Biomedical Named Entity Recognition (Bio-NER)) in Indonesian based on a collection of doctor texts with 60 pairs of similar words and 48 pairs of dissimilar words. Our dataset was processed with the word embedding approach of FastText and BioWordVec. There are two approaches of BioWordVec: with (BioWordVec-2) and without (BioWordVec) translation process. We compared the performance of FastText, BioWordVec, and BioWordVec-2 using measures like accuracy, specificity, and sensitivity. The results show that the BioWordVec-2 model performs better than other models in identifying similar pairs.
| Original language | English |
|---|---|
| Title of host publication | Proceeding - 6th International Conference on Information Technology, Information Systems and Electrical Engineering |
| Subtitle of host publication | Applying Data Sciences and Artificial Intelligence Technologies for Environmental Sustainability, ICITISEE 2022 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 565-569 |
| Number of pages | 5 |
| ISBN (Electronic) | 9798350399615 |
| DOIs | |
| Publication status | Published - 2022 |
| Event | 6th International Conference on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2022 - Virtual, Online, Indonesia Duration: 13 Dec 2022 → 14 Dec 2022 |
Publication series
| Name | Proceeding - 6th International Conference on Information Technology, Information Systems and Electrical Engineering: Applying Data Sciences and Artificial Intelligence Technologies for Environmental Sustainability, ICITISEE 2022 |
|---|
Conference
| Conference | 6th International Conference on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2022 |
|---|---|
| Country/Territory | Indonesia |
| City | Virtual, Online |
| Period | 13/12/22 → 14/12/22 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- Indonesian Medical Terms
- Question-Answer Data
- Semantic Text
- Word Embedding
Fingerprint
Dive into the research topics of 'Identification Semantic Text of Indonesian Medical Terms from Question-Answer Data'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver