Faster algorithms for string matching with k mismatches

The string matching with mismatches problem is that of finding the number of mismatches between a pattern P of length m and every length m substring of the text  T. Currently, the fastest algorithms for this problem are the following. The Galil–Giancarlo algorithm finds all locations where the patte...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of algorithms Ročník 50; číslo 2; s. 257 - 275
Hlavní autoři: Amir, Amihood, Lewenstein, Moshe, Porat, Ely
Médium: Journal Article Konferenční příspěvek
Jazyk:angličtina
Vydáno: San Diego, CA Elsevier Inc 01.02.2004
Elsevier
Témata:
ISSN:0196-6774, 1090-2678
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:The string matching with mismatches problem is that of finding the number of mismatches between a pattern P of length m and every length m substring of the text  T. Currently, the fastest algorithms for this problem are the following. The Galil–Giancarlo algorithm finds all locations where the pattern has at most k errors (where k is part of the input) in time O( nk). The Abrahamson algorithm finds the number of mismatches at every location in time O(n m logm ) . We present an algorithm that is faster than both. Our algorithm finds all locations where the pattern has at most k errors in time O(n k logk ) . We also show an algorithm that solves the above problem in time O(( n+( nk 3)/ m)log k).
ISSN:0196-6774
1090-2678
DOI:10.1016/S0196-6774(03)00097-X