Optimizing genomic signal extraction of COVID-19 variants using the multi linear predictive coding (M-LPC) method

Thaliah Fauz Ardamayanti*, Ridho Nur Rohman Wijaya, Nurul Hidayat, Mohammad Isa Irawan

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) is a novel coronavirus transmitted to humans and causes COVID-19. In some cases, COVID-19 shows symptoms similar to other illnesses, such as influenza, making it difficult to diagnose. We propose a machine learning-based feature extraction method called Multi Linear Predictive Coding (M-LPC) to classify COVID-19 genome signals. M-LPC employs the LPC principle method along with a sliding window technique. This approach utilizes mathematical calculations to generate basic statistical values for identifying features within DNA sequences, including nucleotide frequency, GC distribution, as well as maximum, minimum, mean, and standard deviation. These features can accurately distinguish COVID-19 variants from viruses that have similar symptoms, such as influenza. The advantage of the M-LPC method lies in its ability to be applied to DNA sequences of any length that have been converted into genomic signals, and generate simple features for subsequent machine learning-based classification. Our research shows that M-LPC successfully extracts essential features of the COVID-19 genome signal with an accuracy of 99.86% for three different disease classes and 92.45% for 12 different classes. Therefore, our proposed method with high accuracy can serve as a decision support tool for more accurate diagnosis of COVID-19."

Original languageEnglish
Title of host publicationAIP Conference Proceedings
EditorsBapan Ghosh, Agus Suryanto, Nuning Nuraini, Nur Shofianah
PublisherAmerican Institute of Physics
Edition1
ISBN (Electronic)9780735451520
DOIs
Publication statusPublished - 17 Mar 2025
Event10th International Symposium on Biomathematics, SYMOMATH 2023 - Malang, Indonesia
Duration: 6 Aug 20238 Aug 2023

Publication series

NameAIP Conference Proceedings
Number1
Volume3302
ISSN (Print)0094-243X
ISSN (Electronic)1551-7616

Conference

Conference10th International Symposium on Biomathematics, SYMOMATH 2023
Country/TerritoryIndonesia
CityMalang
Period6/08/238/08/23

Fingerprint

Dive into the research topics of 'Optimizing genomic signal extraction of COVID-19 variants using the multi linear predictive coding (M-LPC) method'. Together they form a unique fingerprint.

Cite this