Relevance Detection using Text Entailment for Health-related Question-Answer Texts with Imbalanced Data

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The health consultation website has questions repeated with similar topics. Thus hence high volume the questions were referred to by the users and given answers by the doctor. Moreover, the answers obtained by doctors can give different answers to similar questions. Otherwise, users find it difficult to judge whether the given answers including relevant or not despite the questions referred to being similar. The research goal is to use the finetuning IndoBERT-QA model for entailment detection and then yield the prediction entailment, contradiction, and neutral to judge the answer based on referred to the question. Strength finetuning the IndoBERT-QA model can manage the word out-of-vocabulary in the Indonesian language and can learn the context from word sequence or the context of a whole sentence that is unknown complexly in the corpus that has a large size. Otherwise BERT model is unable to identify the semantic correlation between text questions and answers using the Indonesian language despite having different syntactic. The experiment result for comparison finetuning IndoBERT-QA model achieved more better accuracy is 0,91 than the BERT model only yielded an accuracy of 0,87. Effectively finetuning the IndoBERT-QA model can classify the class label entailment, contradiction, and neutral than the BERT model. Thus hence the BERT model achieves a small recognize the anatomic structure and medical terms specifically that are important for humans.

Original languageEnglish
Title of host publication2024 International Seminar on Intelligent Technology and Its Applications
Subtitle of host publicationCollaborative Innovation: A Bridging from Academia to Industry towards Sustainable Strategic Partnership, ISITIA 2024 - Proceeding
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages681-686
Number of pages6
Edition2024
ISBN (Electronic)9798350378573
DOIs
Publication statusPublished - 2024
Event25th International Seminar on Intelligent Technology and Its Applications, ISITIA 2024 - Hybrid, Mataram, Indonesia
Duration: 10 Jul 202412 Jul 2024

Conference

Conference25th International Seminar on Intelligent Technology and Its Applications, ISITIA 2024
Country/TerritoryIndonesia
CityHybrid, Mataram
Period10/07/2412/07/24

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • BERT
  • IndoBERT
  • deep learning
  • question-answer dataset
  • relevant detection

Fingerprint

Dive into the research topics of 'Relevance Detection using Text Entailment for Health-related Question-Answer Texts with Imbalanced Data'. Together they form a unique fingerprint.

Cite this