Faster computation of left-bounded shortest unique substrings

Finding shortest unique substrings (SUS) is a fundamental problem in string processing with applications in bioinformatics. In this paper, we present an algorithm for solving a variant of the SUS problem, the left-bounded shortest unique substrings (LSUS). This variant is particularly important in a...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Algorithms for molecular biology Ročník 20; číslo 1; s. 11 - 7
Hlavní autori: Aguiar, Larissa L. M., Louza, Felipe A.
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: London BioMed Central 20.06.2025
BioMed Central Ltd
Springer Nature B.V
BMC
Predmet:
ISSN:1748-7188, 1748-7188
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Finding shortest unique substrings (SUS) is a fundamental problem in string processing with applications in bioinformatics. In this paper, we present an algorithm for solving a variant of the SUS problem, the left-bounded shortest unique substrings (LSUS). This variant is particularly important in applications such as PCR primer design. Our algorithm runs in O ( n ) time using 2 n memory words plus n bytes for an input string of length n . Experimental results with real and artificial datasets show that our algorithm is the fastest alternative in practice, being two times faster (on the average) than related works, while using a similar peak memory footprint.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1748-7188
1748-7188
DOI:10.1186/s13015-025-00287-5