Faster algorithm of string comparison

  • Qi Xiao Yang*
  • , Sung Sam Yuan
  • , Li Zhao
  • , Lu Chun
  • , Sun Peng
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In many applications, it is necessary to determine the field similarity. Our paper introduces a package of substring-based new algorithms to determine Field Similarity. Combined together, our new algorithms not only achieves higher accuracy, but also gains the time complexity O(knm) (k<0.75) for the worst case, O(β*n) where β<6 for the average case and O(1) for the best case. Throughout the paper, we use the approach of comparative examples to show the higher accuracy of our algorithms compared to that proposed in Lee et al. [1]. Theoretical analysis, concrete examples and experimental results show that our algorithms can significantly improve the accuracy and time complexity of the calculation of field similarity.

Original languageEnglish
Pages (from-to)122-133
Number of pages12
JournalPattern Analysis and Applications
Volume6
Issue number2
DOIs
StatePublished - 2003
Externally publishedYes

Keywords

  • Data cleaning
  • Data mining
  • Field similarity
  • Pattern recognition
  • Record similarity
  • String similarity

Fingerprint

Dive into the research topics of 'Faster algorithm of string comparison'. Together they form a unique fingerprint.

Cite this