Abstract

String matching methods are often used to find out DNA pattern. However, basic string matching methods are unable to recognize the mutations case of viruses and bacteria. Distance-based Hamming method can accept character mismatches in an arrangement although it can give varied performance results depending on the number of compared patterns. We modify Hamming method to do pattern analysis of nucleotide arrangement in DNA that has primary Hepatitis C Virus (HCV) infection. We select HCV analysis because Indonesia showed the highest hepatitis case in Southeast Asia. Our experiments use DNA Hepatitis data from World Gen Bank and make comparisons to primary sequences from our partner institution. The problem we encountered while researching is the length of the HCV primary characters that are not always the same. This raises the hamming counting score to become unbalanced. The system we propose is to normalize the primary before being tested on isolate. The result of the normalization will be a constant and then summed with the hamming count. So the results of each hamming primary with each isolate can be balanced. The test results show that hamming method with modification able to give the distance between isolate and primary. The analysis of pattern matching results is similar to the condition of real primary. We purpose this modified hamming distance for analize virus or bacteria mutation, especially on HCV primary.

Original languageEnglish
Title of host publicationProceedings - 2017 2nd International Conferences on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages310-314
Number of pages5
ISBN (Electronic)9781538606582
DOIs
Publication statusPublished - 2 Jul 2017
Event2nd International Conferences on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2017 - Yogyakarta, Indonesia
Duration: 1 Nov 20172 Nov 2017

Publication series

NameProceedings - 2017 2nd International Conferences on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2017
Volume2018-January

Conference

Conference2nd International Conferences on Information Technology, Information Systems and Electrical Engineering, ICITISEE 2017
Country/TerritoryIndonesia
CityYogyakarta
Period1/11/172/11/17

Keywords

  • DNA HCV
  • data mining
  • hamming
  • pattern matching

Fingerprint

Dive into the research topics of 'Distance-based pattern matching of DNA sequences for evaluating primary mutation'. Together they form a unique fingerprint.

Cite this