Comparison of sampling methods for handling imbalance data in deep learning-based predictions of chest X-ray abnormality tags

Hilya Tsaniya*, Chastine Fatichah, Nanik Suciati

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Radiologist detect lung diseases based on abnormality on the chest X-ray images. With the development of computer vision in medical image makes it easier for professional to analyze clinical observation. However, the existing public dataset available suffer from imbalance of abnormality data, and existing solution limits the abnormality into few common diseases. This article trying to explore methods to handling imbalance in chest X-ray abnormality label with grouping minority and majority label based on quartile value. Using Indiana university data with 122 unique labels extracted from patients report using medical indexer, we compare several sampling methods to reduce data imbalance. Sampling method also combined with several neural network classifier model for abnormality tags prediction from X-ray image. From the experiments, Remedial sampling methods got the best result to reduce imbalance with MIR 31 and SCUMBLE 0.10. Remedial also shown best result combined with other classifier averagely with best combination achieved by VGG16 to get the best result for abnormality labels prediction with accuracy 48% and increase in f-1 score 51%, in precision 53%, and in recall 63% than without Remedial sampling.

Original languageEnglish
Title of host publicationICMHI 2023 - 2023 the 7th International Conference on Medical and Health Informatics
PublisherAssociation for Computing Machinery
Pages6-10
Number of pages5
ISBN (Electronic)9798400700712
DOIs
Publication statusPublished - 12 May 2023
Event7th International Conference on Medical and Health Informatics, ICMHI 2023 - Kyoto, Japan
Duration: 12 May 202314 May 2023

Publication series

NameACM International Conference Proceeding Series

Conference

Conference7th International Conference on Medical and Health Informatics, ICMHI 2023
Country/TerritoryJapan
CityKyoto
Period12/05/2314/05/23

Keywords

  • Multi label classification
  • imbalance handling
  • medical image
  • radiograph image

Fingerprint

Dive into the research topics of 'Comparison of sampling methods for handling imbalance data in deep learning-based predictions of chest X-ray abnormality tags'. Together they form a unique fingerprint.

Cite this