Deep Learning Deployment on Big Data Infrastructure Using Apache Spark (Case Study: COVID-19 Detection Using X-Ray Images)

Abdul Munif*, Hendra Ramadhani

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

It is possible to use GPU (Graphic Processing Unit) to increase deep learning performance. This requires us to invest in separate GPUs, which can be relatively expensive. However, if we already have big data infrastructures, it is possible to deploy deep learning on top of them. We utilize the BigDL library on the Apache Spark cluster to run deep learning tasks. BigDL is different from traditional deep learning as it implements distributed and parallel processing. This allows for horizontal scaling of workers using BigDL, resulting in faster training times. Simulation testing on the Apache Spark cluster can use deep learning applications with the transfer learning method, leveraging pre-existing models such as InceptionVl. Deep learning can be developed using the BigDL framework. We use a case study of medical image classification for COVID19 detection. Based on the experiments, the deployment model using BigDL on the Apache Spark infrastructure achieved an average accuracy of 92%, and the average running time is 2 hours, 23 minutes, and 28 seconds.

Original languageEnglish
Title of host publication2023 14th International Conference on Information and Communication Technology and System, ICTS 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages159-163
Number of pages5
ISBN (Electronic)9798350312164
DOIs
Publication statusPublished - 2023
Event14th International Conference on Information and Communication Technology and System, ICTS 2023 - Surabaya, Indonesia
Duration: 4 Oct 20235 Oct 2023

Publication series

Name2023 14th International Conference on Information and Communication Technology and System, ICTS 2023

Conference

Conference14th International Conference on Information and Communication Technology and System, ICTS 2023
Country/TerritoryIndonesia
CitySurabaya
Period4/10/235/10/23

Keywords

  • Apache Spark
  • Big Data
  • COVID19
  • Classification
  • Deep Learning

Fingerprint

Dive into the research topics of 'Deep Learning Deployment on Big Data Infrastructure Using Apache Spark (Case Study: COVID-19 Detection Using X-Ray Images)'. Together they form a unique fingerprint.

Cite this