Faster algorithms for string matching with k mismatches
The string matching with mismatches problem is that of finding the number of mismatches between a pattern P of length m and every length m substring of the text T. Currently, the fastest algorithms for this problem are the following. The Galil–Giancarlo algorithm finds all locations where the patte...
Gespeichert in:
| Veröffentlicht in: | Journal of algorithms Jg. 50; H. 2; S. 257 - 275 |
|---|---|
| Hauptverfasser: | , , |
| Format: | Journal Article Tagungsbericht |
| Sprache: | Englisch |
| Veröffentlicht: |
San Diego, CA
Elsevier Inc
01.02.2004
Elsevier |
| Schlagworte: | |
| ISSN: | 0196-6774, 1090-2678 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Zusammenfassung: | The
string matching with mismatches problem is that of finding the number of mismatches between a pattern
P of length
m and every length
m substring of the text
T. Currently, the fastest algorithms for this problem are the following. The Galil–Giancarlo algorithm finds all locations where the pattern has at most
k errors (where
k is part of the input) in time
O(
nk). The Abrahamson algorithm finds the number of mismatches at every location in time
O(n
m
logm
)
. We present an algorithm that is faster than both. Our algorithm finds all locations where the pattern has at most
k errors in time
O(n
k
logk
)
. We also show an algorithm that solves the above problem in time
O((
n+(
nk
3)/
m)log
k). |
|---|---|
| ISSN: | 0196-6774 1090-2678 |
| DOI: | 10.1016/S0196-6774(03)00097-X |