Text Classification based on Sentence Features and Preprocessing Settings for Labeling eHealth Consultation Answer

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study identifies key aspects of doctor- patient communication by analyzing doctor responses from eHealth Consultation Answer data on the Online Healthcare Consultations (OHC) site, AloDokter. The analysis is performed through topic segmentation to help users better understand the doctors' explanations. Previously, researchers attempted topic segmentation on the same dataset using unsupervised learning and clustering-based text segmentation methods, but the results were suboptimal. To address this issue, we propose a solution that involves segmenting the text using text classification techniques, specifically by leveraging sentence features and refined customized preprocessing. Sentence features are enhanced by combining word vectors with sentence embeddings. The preprocessing improvements involve optimizing the tokenization process and refining stopword handling, focusing on encountered in doctor-patient communication. This research aims to compare several model text classification methods based on sentence features and preprocessing configurations and evaluate their performance. Additionally, we present the performance of text classification methods without including sentence features or customized preprocessing for comparison. Our experiments demonstrate that The Multi-Layer Perceptron (MLP) models with sentence features achieve an average F1-score of 91%, outperforming other text classification models.

Original languageEnglish
Title of host publication7th International Seminar on Research of Information Technology and Intelligent Systems
Subtitle of host publicationAdvanced Intelligent Systems in Contemporary Society, ISRITI 2024 - Proceedings
EditorsFerry Wahyu Wibowo
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1065-1070
Number of pages6
ISBN (Electronic)9798331519643
DOIs
Publication statusPublished - 2024
Event7th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2024 - Hybird, Yogyakarta, Indonesia
Duration: 11 Dec 2024 → …

Publication series

Name7th International Seminar on Research of Information Technology and Intelligent Systems: Advanced Intelligent Systems in Contemporary Society, ISRITI 2024 - Proceedings

Conference

Conference7th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2024
Country/TerritoryIndonesia
CityHybird, Yogyakarta
Period11/12/24 → …

Keywords

  • Multi- Layer Perceptron
  • Sentence Features
  • Text Classification
  • Text Segmentation
  • eHealth consultation Answer

Fingerprint

Dive into the research topics of 'Text Classification based on Sentence Features and Preprocessing Settings for Labeling eHealth Consultation Answer'. Together they form a unique fingerprint.

Cite this