Faster algorithms for string matching with k mismatches

The string matching with mismatches problem is that of finding the number of mismatches between a pattern P of length m and every length m substring of the text  T. Currently, the fastest algorithms for this problem are the following. The Galil–Giancarlo algorithm finds all locations where the patte...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of algorithms Jg. 50; H. 2; S. 257 - 275
Hauptverfasser: Amir, Amihood, Lewenstein, Moshe, Porat, Ely
Format: Journal Article Tagungsbericht
Sprache:Englisch
Veröffentlicht: San Diego, CA Elsevier Inc 01.02.2004
Elsevier
Schlagworte:
ISSN:0196-6774, 1090-2678
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The string matching with mismatches problem is that of finding the number of mismatches between a pattern P of length m and every length m substring of the text  T. Currently, the fastest algorithms for this problem are the following. The Galil–Giancarlo algorithm finds all locations where the pattern has at most k errors (where k is part of the input) in time O( nk). The Abrahamson algorithm finds the number of mismatches at every location in time O(n m logm ) . We present an algorithm that is faster than both. Our algorithm finds all locations where the pattern has at most k errors in time O(n k logk ) . We also show an algorithm that solves the above problem in time O(( n+( nk 3)/ m)log k).
ISSN:0196-6774
1090-2678
DOI:10.1016/S0196-6774(03)00097-X