Load Characterization of AI Applications using DQoES Scheduler for Serving Multiple Requests

Taufiq Odhi Dwi Putra, Royyana Muslim Ijtihadie, Tohari Ahmad

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In today's era, many types of Artificial Intelligence (AI)-based applications are being developed to fulfill a variety of needs, for example, counting objects recorded using a camera, identifying diseases by processing MRI images, and predicting traffic congestion levels at specific times. One way to provide infrastructure resources that match the workload of AI-based applications is to understand the patterns or characteristics of their workloads. Because an AI model is run using a Graphical Processing Unit (GPU), several parts of the AI model's architecture use Video Random Access Memory (VRAM) as temporary storage media to speed up the running time. This paper analyzes the characteristics of workloads from AI-based applications in terms of running time and VRAM usage, where experiments are conducted in two request scenarios: sequential request and concurrent request and using four types of AI models from the Super Resolution General Adversarial Network (SRGAN), namely no prune, random unstructured, L1 norm, and L2 norm. Based on the experimental results, the workload of all four types of SRGAN models will be almost the same when using the sequential request scenario, while in the concurrent request scenario, the four types of SRGAN models have different workloads. There are models that are more effectively processed one at a time rather than several at once, for example, in the SRGAN no prune model, and there are models that if processed several at once at the same time will be more effective compared to being processed one at a time, for example in the SRGAN random unstructured and L2 norm models.

Original languageEnglish
Title of host publication12th International Symposium on Digital Forensics and Security, ISDFS 2024
EditorsAsaf Varol, Murat Karabatak, Cihan Varol, Eva Tuba
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350330366
DOIs
Publication statusPublished - 2024
Event12th International Symposium on Digital Forensics and Security, ISDFS 2024 - San Antonio, United States
Duration: 29 Apr 202430 Apr 2024

Publication series

Name12th International Symposium on Digital Forensics and Security, ISDFS 2024

Conference

Conference12th International Symposium on Digital Forensics and Security, ISDFS 2024
Country/TerritoryUnited States
CitySan Antonio
Period29/04/2430/04/24

Keywords

  • Application
  • Artificial Intelligence
  • Load Characterization
  • Multiple Requests
  • Scheduler

Fingerprint

Dive into the research topics of 'Load Characterization of AI Applications using DQoES Scheduler for Serving Multiple Requests'. Together they form a unique fingerprint.

Cite this