Edit distance weighting modification using phonetic and typographic letter grouping over homomorphic encrypted data

Tohari Ahmad, Kukuh Indrayana, Waskitho Wibisono, Royyana M. Ijtihadie

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Edit Distance string matching algorithm gives same weight for every single mismatching character. In fact, mismatching can be caused by phonetic error, mistyping error, or unknown error. An improvement has been made by Editex which modifies that algorithm. However, it tolerates only the phonetic error. In this paper, we increase its performance by proposing new weighting and distance calculation of that algorithm. Here, the source of mismatching is grouped into phonetic and typographic errors. Characters are divided into groups of phoneticity and typography, which have their own weight. By using this letter grouping, our proposed method is also suitable for implementation in homomorphic encrypted data. Experimental results show that this method produces lower false positive rates than the Edit Distance and Editex algorithms. The proposed method generates 2.2 false positives per experiment, while Edit Distance and Editex produce 8.24 and 3.12, respectively. It can be inferred that this proposed method is able to produce a relatively low error rate.

Original languageEnglish
Title of host publicationProceeding - 2017 3rd International Conference on Science in Information Technology
Subtitle of host publicationTheory and Application of IT for Education, Industry and Society in Big Data Era, ICSITech 2017
EditorsLala Septem Riza, Andri Pranolo, Aji Prasetyo Wibawa, Enjun Junaeti, Yaya Wihardi, Ummi Raba'ah Hashim, Shi-Jinn Horng, Rafal Drezewski, Heui Seok Lim, Goutam Chakraborty, Leonel Hernandez, Shah Nazir
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages408-412
Number of pages5
ISBN (Electronic)9781509058662
DOIs
Publication statusPublished - 1 Jul 2017
Event3rd International Conference on Science in Information Technology, ICSITech 2017 - Bandung, Indonesia
Duration: 25 Oct 201726 Oct 2017

Publication series

NameProceeding - 2017 3rd International Conference on Science in Information Technology: Theory and Application of IT for Education, Industry and Society in Big Data Era, ICSITech 2017
Volume2018-January

Conference

Conference3rd International Conference on Science in Information Technology, ICSITech 2017
Country/TerritoryIndonesia
CityBandung
Period25/10/1726/10/17

Keywords

  • edit distance
  • homomorphic encryption
  • information security
  • string matching

Fingerprint

Dive into the research topics of 'Edit distance weighting modification using phonetic and typographic letter grouping over homomorphic encrypted data'. Together they form a unique fingerprint.

Cite this