Faster algorithms for string matching with k mismatches
The string matching with mismatches problem is that of finding the number of mismatches between a pattern P of length m and every length m substring of the text T. Currently, the fastest algorithms for this problem are the following. The Galil–Giancarlo algorithm finds all locations where the patte...
Uloženo v:
| Vydáno v: | Journal of algorithms Ročník 50; číslo 2; s. 257 - 275 |
|---|---|
| Hlavní autoři: | , , |
| Médium: | Journal Article Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
San Diego, CA
Elsevier Inc
01.02.2004
Elsevier |
| Témata: | |
| ISSN: | 0196-6774, 1090-2678 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | The
string matching with mismatches problem is that of finding the number of mismatches between a pattern
P of length
m and every length
m substring of the text
T. Currently, the fastest algorithms for this problem are the following. The Galil–Giancarlo algorithm finds all locations where the pattern has at most
k errors (where
k is part of the input) in time
O(
nk). The Abrahamson algorithm finds the number of mismatches at every location in time
O(n
m
logm
)
. We present an algorithm that is faster than both. Our algorithm finds all locations where the pattern has at most
k errors in time
O(n
k
logk
)
. We also show an algorithm that solves the above problem in time
O((
n+(
nk
3)/
m)log
k). |
|---|---|
| ISSN: | 0196-6774 1090-2678 |
| DOI: | 10.1016/S0196-6774(03)00097-X |