Average complexity of backward q-gram string matching algorithms

Many efficient string matching algorithms make use of q-grams and process the text in windows which are read backward. In this paper we provide a framework for analyzing the average case complexity of these algorithms taking into account the statistical dependencies between overlapping q-grams. We a...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Information processing letters Jg. 112; H. 11; S. 433 - 437
1. Verfasser:	Salmela, Leena
Format:	Journal Article
Sprache:	Englisch
Veröffentlicht:	Amsterdam Elsevier B.V 15.06.2012 Elsevier Sequoia S.A
Schlagworte:	Algorithms Analysis of algorithms Average case complexity Complexity Computer science Data processing Mathematical problems Optimization String matching Studies Texts Average case complexity Analysis of algorithms String matching
ISSN:	0020-0190, 1872-6119
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Many efficient string matching algorithms make use of q-grams and process the text in windows which are read backward. In this paper we provide a framework for analyzing the average case complexity of these algorithms taking into account the statistical dependencies between overlapping q-grams. We apply this to the q-gram Boyer–Moore–Horspool algorithm adapted to various string matching problems and show that the algorithm is optimal on average. ► We study the average case complexity of string matching algorithms that use q-grams. ► We take into account the statistical dependencies of overlapping q-grams. ► We show that q-gram Boyer–Moore–Horspool style algorithms are average optimal.
Bibliographie:	SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23
ISSN:	0020-0190 1872-6119
DOI:	10.1016/j.ipl.2012.02.010