Elastic-Degenerate String Matching with 1 Error or Mismatch
An elastic-degenerate (ED) string is a sequence of n finite sets of strings of total length N , introduced to represent a set of related DNA sequences, also known as a pangenome . The ED string matching (EDSM) problem consists in reporting all occurrences of a pattern of length m in an ED text. The...
Saved in:
| Published in: | Theory of computing systems Vol. 68; no. 5; pp. 1442 - 1467 |
|---|---|
| Main Authors: | , , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
New York
Springer US
01.10.2024
Springer Nature B.V Springer Verlag |
| Subjects: | |
| ISSN: | 1432-4350, 1433-0490 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | An elastic-degenerate (ED) string is a sequence of
n
finite sets of strings of total length
N
, introduced to represent a set of related DNA sequences, also known as a
pangenome
. The ED string matching (EDSM) problem consists in reporting all occurrences of a pattern of length
m
in an ED text. The EDSM problem has recently received some attention by the combinatorial pattern matching community, culminating in an
O
~
(
n
m
ω
-
1
)
+
O
(
N
)
-time algorithm [Bernardini et al., SIAM J. Comput. 2022], where
ω
denotes the matrix multiplication exponent and the
O
~
(
·
)
notation suppresses polylog factors. In the
k
-EDSM problem, the approximate version of EDSM, we are asked to report all pattern occurrences with at most
k
errors.
k
-EDSM can be solved in
O
(
k
2
m
G
+
k
N
)
time, under edit distance, or
O
(
k
m
G
+
k
N
)
time, under Hamming distance, where
G
denotes the total number of strings in the ED text [Bernardini et al., Theor. Comput. Sci. 2020]. Unfortunately,
G
is only bounded by
N
, and so even for
k
=
1
, the existing algorithms run in
Ω
(
m
N
)
time in the worst case. In this paper we make progress in this direction. We show that 1-EDSM can be solved in
O
(
(
n
m
2
+
N
)
log
m
)
or
O
(
n
m
3
+
N
)
time under edit distance. For the decision version of the problem, we present a faster
O
(
n
m
2
log
m
+
N
log
log
m
)
-time algorithm. We also show that 1-EDSM can be solved in
O
(
n
m
2
+
N
log
m
)
time under Hamming distance. Our algorithms for edit distance rely on non-trivial reductions from 1-EDSM to special instances of classic computational geometry problems (2d rectangle stabbing or 2d range emptiness), which we show how to solve efficiently. In order to obtain an even faster algorithm for Hamming distance, we rely on employing and adapting the
k
-errata trees for indexing with errors [Cole et al., STOC 2004]. This is an extended version of a paper presented at LATIN 2022. |
|---|---|
| AbstractList | An elastic-degenerate (ED) string is a sequence of n finite sets of strings of total length N , introduced to represent a set of related DNA sequences, also known as a pangenome. The ED string matching (EDSM) problem consists in reporting all occurrences of a pattern of length m in an ED text. The EDSM problem has recently received some attention by the combinatorial pattern matching community, culminating in an Õ(nm ω-1 ) + O(N )-time algorithm [Bernardini et al., SIAM J. Comput. 2022], where ω denotes the matrix multiplication exponent and the Õ(•) notation suppresses polylog factors. In the k-EDSM problem, the approximate version of EDSM, we are asked to report all pattern occurrences with at most k errors. k-EDSM can be solved in O(k 2 mG + k N ) time, under edit distance, or O(kmG + k N ) time, under Hamming distance, where G denotes the total number of strings in the ED text [Bernardini et al., Theor. Comput. Sci. 2020]. Unfortunately, G is only bounded by N , and so even for k = 1, the existing algorithms run in Ω(m N ) time in the worst case. In this paper we make progress in this direction. We show that 1-EDSM can be solved in O((nm 2 + N ) log m) or O(nm 3 + N ) time under edit distance. For the decision version of the problem, we present a faster O(nm 2 √ log m + N log log m)-time algorithm. We also show that 1-EDSM can be solved in O(nm 2 + N log m) time under Hamming distance. Our algorithms for edit distance rely on non-trivial reductions from 1-EDSM to special instances of classic computational geometry problems (2d rectangle stabbing or 2d range emptiness), which we show how to solve efficiently. In order to obtain an even faster algorithm for Hamming distance, we rely on employing and adapting the k-errata trees for indexing with errors [Cole et al., STOC 2004]. An elastic-degenerate (ED) string is a sequence of n finite sets of strings of total length N, introduced to represent a set of related DNA sequences, also known as a pangenome. The ED string matching (EDSM) problem consists in reporting all occurrences of a pattern of length m in an ED text. The EDSM problem has recently received some attention by the combinatorial pattern matching community, culminating in an O~(nmω-1)+O(N)-time algorithm [Bernardini et al., SIAM J. Comput. 2022], where ω denotes the matrix multiplication exponent and the O~(·) notation suppresses polylog factors. In the k-EDSM problem, the approximate version of EDSM, we are asked to report all pattern occurrences with at most k errors. k-EDSM can be solved in O(k2mG+kN) time, under edit distance, or O(kmG+kN) time, under Hamming distance, where G denotes the total number of strings in the ED text [Bernardini et al., Theor. Comput. Sci. 2020]. Unfortunately, G is only bounded by N, and so even for k=1, the existing algorithms run in Ω(mN) time in the worst case. In this paper we make progress in this direction. We show that 1-EDSM can be solved in O((nm2+N)logm) or O(nm3+N) time under edit distance. For the decision version of the problem, we present a faster O(nm2logm+Nloglogm)-time algorithm. We also show that 1-EDSM can be solved in O(nm2+Nlogm) time under Hamming distance. Our algorithms for edit distance rely on non-trivial reductions from 1-EDSM to special instances of classic computational geometry problems (2d rectangle stabbing or 2d range emptiness), which we show how to solve efficiently. In order to obtain an even faster algorithm for Hamming distance, we rely on employing and adapting the k-errata trees for indexing with errors [Cole et al., STOC 2004]. This is an extended version of a paper presented at LATIN 2022. An elastic-degenerate (ED) string is a sequence of n finite sets of strings of total length N , introduced to represent a set of related DNA sequences, also known as a pangenome . The ED string matching (EDSM) problem consists in reporting all occurrences of a pattern of length m in an ED text. The EDSM problem has recently received some attention by the combinatorial pattern matching community, culminating in an O ~ ( n m ω - 1 ) + O ( N ) -time algorithm [Bernardini et al., SIAM J. Comput. 2022], where ω denotes the matrix multiplication exponent and the O ~ ( · ) notation suppresses polylog factors. In the k -EDSM problem, the approximate version of EDSM, we are asked to report all pattern occurrences with at most k errors. k -EDSM can be solved in O ( k 2 m G + k N ) time, under edit distance, or O ( k m G + k N ) time, under Hamming distance, where G denotes the total number of strings in the ED text [Bernardini et al., Theor. Comput. Sci. 2020]. Unfortunately, G is only bounded by N , and so even for k = 1 , the existing algorithms run in Ω ( m N ) time in the worst case. In this paper we make progress in this direction. We show that 1-EDSM can be solved in O ( ( n m 2 + N ) log m ) or O ( n m 3 + N ) time under edit distance. For the decision version of the problem, we present a faster O ( n m 2 log m + N log log m ) -time algorithm. We also show that 1-EDSM can be solved in O ( n m 2 + N log m ) time under Hamming distance. Our algorithms for edit distance rely on non-trivial reductions from 1-EDSM to special instances of classic computational geometry problems (2d rectangle stabbing or 2d range emptiness), which we show how to solve efficiently. In order to obtain an even faster algorithm for Hamming distance, we rely on employing and adapting the k -errata trees for indexing with errors [Cole et al., STOC 2004]. This is an extended version of a paper presented at LATIN 2022. An elastic-degenerate (ED) string is a sequence of n finite sets of strings of total length N , introduced to represent a set of related DNA sequences, also known as a pangenome . The ED string matching (EDSM) problem consists in reporting all occurrences of a pattern of length m in an ED text. The EDSM problem has recently received some attention by the combinatorial pattern matching community, culminating in an $$\mathcal {\tilde{O}}(nm^{\omega -1})+\mathcal {O}(N)$$ O ~ ( n m ω - 1 ) + O ( N ) -time algorithm [Bernardini et al., SIAM J. Comput. 2022], where $$\omega $$ ω denotes the matrix multiplication exponent and the $$\mathcal {\tilde{O}}(\cdot )$$ O ~ ( · ) notation suppresses polylog factors. In the k -EDSM problem, the approximate version of EDSM, we are asked to report all pattern occurrences with at most k errors. k -EDSM can be solved in $$\mathcal {O}(k^2mG+kN)$$ O ( k 2 m G + k N ) time, under edit distance, or $$\mathcal {O}(kmG+kN)$$ O ( k m G + k N ) time, under Hamming distance, where G denotes the total number of strings in the ED text [Bernardini et al., Theor. Comput. Sci. 2020]. Unfortunately, G is only bounded by N , and so even for $$k=1$$ k = 1 , the existing algorithms run in $$\varOmega (mN)$$ Ω ( m N ) time in the worst case. In this paper we make progress in this direction. We show that 1-EDSM can be solved in $$\mathcal {O}((nm^2 + N)\log m)$$ O ( ( n m 2 + N ) log m ) or $$\mathcal {O}(nm^3 + N)$$ O ( n m 3 + N ) time under edit distance. For the decision version of the problem, we present a faster $$\mathcal {O}(nm^2\sqrt{\log m} + N\log \log m)$$ O ( n m 2 log m + N log log m ) -time algorithm. We also show that 1-EDSM can be solved in $$\mathcal {O}(nm^2 + N\log m)$$ O ( n m 2 + N log m ) time under Hamming distance. Our algorithms for edit distance rely on non-trivial reductions from 1-EDSM to special instances of classic computational geometry problems (2d rectangle stabbing or 2d range emptiness), which we show how to solve efficiently. In order to obtain an even faster algorithm for Hamming distance, we rely on employing and adapting the k -errata trees for indexing with errors [Cole et al., STOC 2004]. This is an extended version of a paper presented at LATIN 2022. |
| Author | Sweering, Michelle Pissis, Solon P. Gabory, Esteban Zuba, Wiktor Stougie, Leen Bernardini, Giulia |
| Author_xml | – sequence: 1 givenname: Giulia surname: Bernardini fullname: Bernardini, Giulia organization: University of Trieste – sequence: 2 givenname: Esteban surname: Gabory fullname: Gabory, Esteban organization: CWI – sequence: 3 givenname: Solon P. surname: Pissis fullname: Pissis, Solon P. email: solon.pissis@cwi.nl organization: CWI, Vrije Universiteit – sequence: 4 givenname: Leen surname: Stougie fullname: Stougie, Leen organization: CWI, Vrije Universiteit – sequence: 5 givenname: Michelle surname: Sweering fullname: Sweering, Michelle organization: CWI – sequence: 6 givenname: Wiktor surname: Zuba fullname: Zuba, Wiktor organization: CWI |
| BackLink | https://inria.hal.science/hal-04747458$$DView record in HAL |
| BookMark | eNp9kEFPwyAUx4mZidv0C3hq4skD-iiwQjwtczqTLR7UM2GUbixdO4Fp_Pa2q9HEwwKEF_j_Xl5-A9Sr6soidEnghgBktwEgTRmG5hAgkmFxgvqEUYqBSegd6hQzyuEMDULYAAAVAH10Ny11iM7ge7uylfU62uQleletkoWOZt0Wny6uE5JMva990uyFC9v27xydFroM9uLnHqK3h-nrZIbnz49Pk_EcGyqziAuWCckFlVTnQGGZUavzbLk0o9TkMhdWQCG5lqYJ6ZQzOeKEm4zlnOUst4QO0XXXd61LtfNuq_2XqrVTs_FctW_AsmZx8dFmr7rsztfvexui2tR7XzXjKUpSGAlJiWxSaZcyvg7B2-K3LQHVClWdUNUIVQehSjSQ-AcZF3V0dRW9duVxlHZo2LVqrf-b6gj1DXGviVs |
| CitedBy_id | crossref_primary_10_1109_ACCESS_2025_3597547 |
| Cites_doi | 10.1007/978-3-540-70575-8_8 10.1109/FOCS54457.2022.00072 10.1007/BFb0029792 10.1016/j.ic.2020.104616 10.4230/LIPIcs.CPM.2020.14 10.1017/CBO9780511546853 10.3233/FI-2020-1947 10.4230/LIPIcs.ISAAC.2021.20 10.1145/1007352.1007374 10.1145/3588334 10.1137/S0097539700370527 10.1006/jagm.2000.1104 10.1093/bioinformatics/btz162 10.4230/LIPIcs.WABI.2018.21 10.4230/LIPIcs.WABI.2020.7 10.1145/828.1884 10.1016/0022-0000(88)90045-1 10.1016/0304-3975(86)90178-7 10.1137/20M1368033 10.1016/j.tcs.2019.08.012 10.4230/LIPIcs.CPM.2017.9 10.1137/0217026 10.1007/978-3-031-20624-5_2 10.1016/0022-2836(71)90319-6 10.1109/FOCS46700.2020.00095 10.1007/978-3-030-20081-7_23 10.1016/S0304-3975(99)00333-3 10.1142/9789812797919_0002 10.1137/S0097539703435728 10.1089/cmb.2015.0235 10.1006/jagm.1999.1063 10.1016/0022-0000(83)90006-5 10.1093/bioinformatics/bty506 10.1016/S0304-3975(03)00053-7 10.1145/3385898 10.4230/LIPIcs.ICALP.2018.62 10.1145/1998196.1998198 10.1007/978-3-030-59212-7_6 10.1016/S0196-6774(03)00097-X 10.1007/10719839_9 10.1007/3-540-60044-2_51 10.4230/LIPIcs.ESA.2020.54 10.4230/LIPIcs.SEA.2018.16 10.4230/LIPIcs.CPM.2018.9 10.4230/LIPIcs.ICALP.2019.21 10.1109/SFCS.1997.646102 |
| ContentType | Journal Article |
| Copyright | The Author(s) 2024 The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. Attribution |
| Copyright_xml | – notice: The Author(s) 2024 – notice: The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: Attribution |
| DBID | C6C AAYXX CITATION 3V. 7SC 7WY 7WZ 7XB 87Z 88I 8AL 8AO 8FD 8FE 8FG 8FK 8FL ABJCF ABUWG AFKRA ARAPS AZQEC BENPR BEZIV BGLVJ CCPQU DWQXO FRNLG F~G GNUQQ HCIFZ JQ2 K60 K6~ K7- L.- L6V L7M L~C L~D M0C M0N M2P M7S P5Z P62 PHGZM PHGZT PKEHL PQBIZ PQBZA PQEST PQGLB PQQKQ PQUKI PRINS PTHSS PYYUZ Q9U 1XC VOOES |
| DOI | 10.1007/s00224-024-10194-8 |
| DatabaseName | Springer Nature Link CrossRef ProQuest Central (Corporate) Computer and Information Systems Abstracts ABI/INFORM Collection ABI/INFORM Global (PDF only) ProQuest Central (purchase pre-March 2016) ABI/INFORM Collection Science Database (Alumni Edition) Computing Database (Alumni Edition) ProQuest Pharma Collection Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection ProQuest Central (Alumni) (purchase pre-March 2016) ABI/INFORM Collection (Alumni) Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland Advanced Technologies & Computer Science Collection ProQuest Central Essentials ProQuest Central Business Premium Collection Technology Collection ProQuest One ProQuest Central Business Premium Collection (Alumni) ABI/INFORM Global (Corporate) ProQuest Central Student SciTech Premium Collection ProQuest Computer Science Collection ProQuest Business Collection (Alumni Edition) ProQuest Business Collection Computer Science Database ABI/INFORM Professional Advanced ProQuest Engineering Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional ABI/INFORM Global Computing Database Science Database Engineering Database Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Premium ProQuest One Academic (New) ProQuest One Academic Middle East (New) ProQuest One Business ProQuest One Business (Alumni) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic (retired) ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection ABI/INFORM Collection China ProQuest Central Basic Hyper Article en Ligne (HAL) Hyper Article en Ligne (HAL) (Open Access) |
| DatabaseTitle | CrossRef ProQuest Business Collection (Alumni Edition) Computer Science Database ProQuest Central Student ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection Computer and Information Systems Abstracts SciTech Premium Collection ProQuest Central China ABI/INFORM Complete ProQuest One Applied & Life Sciences ProQuest Central (New) Engineering Collection Advanced Technologies & Aerospace Collection Business Premium Collection ABI/INFORM Global Engineering Database ProQuest Science Journals (Alumni Edition) ProQuest One Academic Eastern Edition ProQuest Technology Collection ProQuest Business Collection ProQuest One Academic UKI Edition ProQuest One Academic ProQuest One Academic (New) ABI/INFORM Global (Corporate) ProQuest One Business Technology Collection Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest One Academic Middle East (New) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest Pharma Collection ProQuest Central ABI/INFORM Professional Advanced ProQuest Engineering Collection ProQuest Central Korea Advanced Technologies Database with Aerospace ABI/INFORM Complete (Alumni Edition) ProQuest Computing ABI/INFORM Global (Alumni Edition) ProQuest Central Basic ProQuest Science Journals ProQuest Computing (Alumni Edition) ABI/INFORM China ProQuest SciTech Collection Computer and Information Systems Abstracts Professional Advanced Technologies & Aerospace Database Materials Science & Engineering Collection ProQuest One Business (Alumni) ProQuest Central (Alumni) Business Premium Collection (Alumni) |
| DatabaseTitleList | ProQuest Business Collection (Alumni Edition) CrossRef |
| Database_xml | – sequence: 1 dbid: BENPR name: ProQuest Central url: https://www.proquest.com/central sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Mathematics Computer Science |
| EISSN | 1433-0490 |
| EndPage | 1467 |
| ExternalDocumentID | oai:HAL:hal-04747458v1 10_1007_s00224_024_10194_8 |
| GrantInformation_xml | – fundername: Ministero dell'Università e della Ricerca grantid: MUR - FSE REACT EU - PON R&I 2014-2020 funderid: http://dx.doi.org/10.13039/501100021856 – fundername: European Commission grantid: 956229; 872539; 872539 funderid: http://dx.doi.org/10.13039/501100000780 – fundername: Nederlandse Organisatie voor Wetenschappelijk Onderzoek grantid: NETWORKS-024.002.003; NETWORKS-024.002.003; NETWORKS-024.002.003 funderid: http://dx.doi.org/10.13039/501100003246 – fundername: Nederlandse Organisatie voor Toegepast Natuurwetenschappelijk Onderzoek grantid: OCENW.GROOT.2019.015 funderid: http://dx.doi.org/10.13039/501100019926 |
| GroupedDBID | --Z -59 -5G -BR -EM -Y2 -~C -~X .4S .86 .DC .VR 06D 0R~ 0VY 123 1N0 2.D 203 29Q 2J2 2JN 2JY 2KG 2LR 2P1 2VQ 2~H 30V 3V. 4.4 406 408 409 40D 40E 5QI 5VS 67Z 6NX 7WY 88I 8AO 8FE 8FG 8FL 8TC 8UJ 8V8 95- 95. 95~ 96X AAAVM AABHQ AACDK AAHNG AAIAL AAJBT AAJKR AANZL AAOBN AARHV AARTL AASML AATNV AATVU AAUYE AAWCG AAYIU AAYQN AAYTO AAYZH ABAKF ABBBX ABBXA ABDBF ABDZT ABECU ABFTD ABFTV ABHLI ABHQN ABJCF ABJNI ABJOX ABKCH ABKTR ABLJU ABMNI ABMQK ABNWP ABQBU ABQSL ABSXP ABTEG ABTHY ABTKH ABTMW ABULA ABUWG ABWNU ABXPI ACAOD ACBXY ACDTI ACGFS ACGOD ACHSB ACHXU ACIWK ACKNC ACMDZ ACMLO ACOKC ACOMO ACPIV ACUHS ACZOJ ADHIR ADIMF ADINQ ADKNI ADKPE ADMLS ADPHR ADRFC ADTPH ADURQ ADYFF ADZKW AEBTG AEFIE AEFQL AEGAL AEGNC AEJHL AEJRE AEKMD AEMOZ AEMSY AENEX AEOHA AEPYU AESKC AETLH AEVLU AEXYK AFBBN AFEXP AFGCZ AFKRA AFLOW AFQWF AFWTZ AFZKB AGAYW AGDGC AGGDS AGJBK AGMZJ AGQEE AGQMX AGWIL AGWZB AGYKE AHAVH AHBYD AHQJS AHSBF AHYZX AIAKS AIGIU AIIXL AILAN AITGF AJBLW AJRNO AJZVZ AKVCP ALMA_UNASSIGNED_HOLDINGS ALWAN AMKLP AMXSW AMYLF AOCGG ARAPS ARCSS ARMRJ AXYYD AYJHY AZFZN AZQEC B-. B0M BA0 BDATZ BENPR BEZIV BGLVJ BGNMA BPHCQ BSONS C6C CAG CCPQU COF CS3 CSCUP D0L DDRTE DL5 DNIVK DPUIP DU5 DWQXO EAD EAP EBA EBLON EBR EBS EBU ECS EDO EIOEI EJD EMK EPL ESBYG EST ESX FEDTE FERAY FFXSO FIGPU FINBP FNLPD FRNLG FRRFC FSGXE FWDCC GGCAI GGRSB GJIRD GNUQQ GNWQR GQ3 GQ6 GQ7 GQ8 GROUPED_ABI_INFORM_COMPLETE GXS H13 HCIFZ HF~ HG5 HG6 HMJXF HQYDN HRMNR HVGLF HZ~ H~9 I-F I09 IHE IJ- IKXTQ ITM IWAJR IXC IZIGR IZQ I~X I~Z J-C J0Z JBSCW JCJTX JZLTJ K1G K60 K6V K6~ K7- KDC KOV L6V LAS LLZTM M0C M0N M2P M4Y M7S MA- MK~ ML~ NB0 NDZJH NPVJJ NQJWS NU0 O9- O93 O9G O9I O9J OAM P19 P2P P62 P9O PF- PQBIZ PQBZA PQQKQ PROAC PT4 PT5 PTHSS Q2X QOK QOS R89 R9I RNI ROL RPX RSV RZK S16 S1Z S26 S27 S28 S3B SAP SCLPG SCO SDH SDM SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW STPWE SZN T13 T16 TH9 TSG TSK TSV TUC TUS U2A UG4 UOJIU UTJUX UZXMN VC2 VFIZW W23 W48 WH7 WK8 XOL YLTOR Z45 Z7X Z83 Z88 Z8R Z8W Z92 ZMTXR ~8M AAPKM AAYXX ABBRH ABDBE ABFSG ABRTQ ACSTC ADHKG AEZWR AFDZB AFFHD AFHIU AFOHR AGQPQ AHPBZ AHWEU AIXLP AMVHM ATHPR AYFIA CITATION PHGZM PHGZT PQGLB 7SC 7XB 8AL 8FD 8FK JQ2 L.- L7M L~C L~D PKEHL PQEST PQUKI PRINS Q9U 1XC VOOES |
| ID | FETCH-LOGICAL-c397t-f478958393ad030b73ead7bbc62cd9d8e80f95a9c958a25496515c74d54d4de13 |
| IEDL.DBID | K7- |
| ISICitedReferencesCount | 3 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001321482600001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1432-4350 |
| IngestDate | Tue Nov 11 03:01:03 EST 2025 Wed Nov 05 09:31:29 EST 2025 Sat Nov 29 03:28:56 EST 2025 Tue Nov 18 21:01:40 EST 2025 Fri Feb 21 02:37:22 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 5 |
| Keywords | String algorithms Elastic-degenerate strings Edit distance Hamming distance Approximate string matching |
| Language | English |
| License | Attribution: http://creativecommons.org/licenses/by |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c397t-f478958393ad030b73ead7bbc62cd9d8e80f95a9c958a25496515c74d54d4de13 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ORCID | 0000-0003-1200-6015 |
| OpenAccessLink | https://link.springer.com/10.1007/s00224-024-10194-8 |
| PQID | 3120689319 |
| PQPubID | 48907 |
| PageCount | 26 |
| ParticipantIDs | hal_primary_oai_HAL_hal_04747458v1 proquest_journals_3120689319 crossref_primary_10_1007_s00224_024_10194_8 crossref_citationtrail_10_1007_s00224_024_10194_8 springer_journals_10_1007_s00224_024_10194_8 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-10-01 |
| PublicationDateYYYYMMDD | 2024-10-01 |
| PublicationDate_xml | – month: 10 year: 2024 text: 2024-10-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationPlace | New York |
| PublicationPlace_xml | – name: New York |
| PublicationTitle | Theory of computing systems |
| PublicationTitleAbbrev | Theory Comput Syst |
| PublicationYear | 2024 |
| Publisher | Springer US Springer Nature B.V Springer Verlag |
| Publisher_xml | – name: Springer US – name: Springer Nature B.V – name: Springer Verlag |
| References | Bernardini, G., Gawrychowski, P., Pisanti, N., Pissis, S.P., Rosone, G.: Even faster elastic-degenerate string matching via fast matrix multiplication. In: Baier, C., Chatzigiannakis, I., Flocchini, P., Leonardi, S. (eds.) 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, July 9-12, 2019, Patras, Greece, LIPIcs, vol. 132, pp. 21:1–21:15. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2019). https://doi.org/10.4230/LIPIcs.ICALP.2019.21 ChazelleBA functional approach to data structures and its use in multidimensional searchingSIAM J. Comput.198817342746294193710.1137/0217026 Cole, R., Gottlieb, L., Lewenstein, M.: Dictionary matching and indexing with errors and don’t cares. In: Babai, L. (ed.) Proceedings of the 36th Annual ACM Symposium on Theory of Computing, Chicago, IL, USA, June 13-16, 2004, pp. 91–100. ACM (2004). https://doi.org/10.1145/1007352.1007374 AmirALewensteinMPoratEFaster algorithms for string matching with k mismatchesJ. Algorithms2004502257275205302010.1016/S0196-6774(03)00097-X ShiQJáJáJFNovel transformation techniques using q-heaps with applications to computational geometrySIAM J. Comput.200534614741492216575110.1137/S0097539703435728 AlzamelMAyadLAKBernardiniGGrossiRIliopoulosCSPisantiNPissisSPRosoneGComparing degenerate strings. Fundam. Informaticae20201751–4415810.3233/FI-2020-1947 Manber, U., Wu, S.: Approximate string matching with arbitrary costs for text and hypertext, pp. 22–33. https://doi.org/10.1142/9789812797919_0002. https://www.worldscientific.com/doi/abs/10.1142/9789812797919_0002 RautiainenMMäkinenVMarschallTBit-parallel sequence-to-graph alignment. Bioinform.201935193599360710.1093/bioinformatics/btz162 AmirAKeselmanDLandauGMLewensteinMLewensteinNRodehMText indexing and dictionary matching with one errorJ. Algorithms2000372309325178883810.1006/jagm.2000.1104 Chan, T.M., Larsen, K.G., Patrascu, M.: Orthogonal range searching on the RAM, revisited. In: Hurtado, F., van Kreveld, M.J., (eds.) Proceedings of the 27th ACM Symposium on Computational Geometry, Paris, France, June 13-15, 2011, pp. 1–10. ACM (2011). https://doi.org/10.1145/1998196.1998198 Charalampopoulos, P., Kociumaka, T., Wellnitz, P.: Faster pattern matching under edit distance : A reduction to dynamic puzzle matching and the seaweed monoid of permutation matrices. In: 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, Denver, CO, USA, October 31 - November 3, 2022, pp. 698–707. IEEE (2022). https://doi.org/10.1109/FOCS54457.2022.00072 Gao, Y., He, M., Nekrich, Y.: Fast preprocessing for optimal orthogonal range reporting and range successor with applications to text indexing. In: Grandoni, F., Herman, G., Sanders, P. (eds.) 28th Annual European Symposium on Algorithms, ESA 2020, September 7-9, 2020, Pisa, Italy (Virtual Conference), LIPIcs, vol. 173, pp. 54:1–54:18. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2020). https://doi.org/10.4230/LIPIcs.ESA.2020.54 LandauGMVishkinUFast string matching with k differencesJ. Comput. Syst. Sci.1988371637897365710.1016/0022-0000(88)90045-1 Grossi, R., Iliopoulos, C.S., Liu, C., Pisanti, N., Pissis, S.P., Retha, A., Rosone, G., Vayani, F., Versari, L.: On-line pattern matching on similar texts. In: Kärkkäinen, J., Radoszewski, J., Rytter, W. (eds.) 28th Annual Symposium on Combinatorial Pattern Matching, CPM 2017, July 4-6, 2017, Warsaw, Poland, LIPIcs, vol. 78, pp. 9:1–9:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2017). https://doi.org/10.4230/LIPIcs.CPM.2017.9 Sleator, D.D., Tarjan, R.E.: A data structure for dynamic trees. J. Comput. Syst. Sci. 26(3), 362–391 (1983). https://doi.org/10.1016/0022-0000(83)90006-5. https://www.sciencedirect.com/science/article/pii/0022000083900065 Ruzic, M.: Constructing efficient dictionaries in close to sorting time. In: Aceto, L., Damgård, I., Goldberg, L.A., Halldórsson, M.M., Ingólfsdóttir, A., Walukiewicz, I. (eds.) Automata, Languages and Programming, 35th International Colloquium, ICALP 2008, Reykjavik, Iceland, July 7-11, 2008, Proceedings, Part I: Tack A: Algorithms, Automata, Complexity, and Games, Lecture Notes in Computer Science, vol. 5125, pp. 84–95. Springer (2008). https://doi.org/10.1007/978-3-540-70575-8_8 BernardiniGGawrychowskiPPisantiNPissisSPRosoneGElastic-degenerate string matching via fast matrix multiplicationSIAM J. Comput.2022513549576442371410.1137/20M1368033 CharalampopoulosPIliopoulosCSLiuCPissisSPProperty suffix array with applications in indexing weighted sequencesACM J. Exp. Algorithmics202025116410286110.1145/3385898 Pissis, S.P., Retha, A.: Dictionary matching in elastic-degenerate texts with applications in searching VCF files on-line. In: D’Angelo, G. (ed.) 17th International Symposium on Experimental Algorithms, SEA 2018, June 27-29, 2018, L’Aquila, Italy, LIPIcs, vol. 103, pp. 16:1–16:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2018). https://doi.org/10.4230/LIPIcs.SEA.2018.16 Gawrychowski, P., Uznanski, P.: Towards unified approximate pattern matching for Hamming and l_1 distance. In: Chatzigiannakis, I., Kaklamanis, C., Marx, D., Sannella, D. (eds.) 45th International Colloquium on Automata, Languages, and Programming, ICALP 2018, July 9-13, 2018, Prague, Czech Republic, LIPIcs, vol. 107, pp. 62:1–62:13. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2018). https://doi.org/10.4230/LIPIcs.ICALP.2018.62 Charalampopoulos, P., Kociumaka, T., Wellnitz, P.: Faster approximate pattern matching: A unified approach. In: Irani, S. (ed.) 61st IEEE Annual Symposium on Foundations of Computer Science, FOCS 2020, Durham, NC, USA, November 16-19, 2020, pp. 978–989. IEEE (2020). https://doi.org/10.1109/FOCS46700.2020.00095 ColeRHariharanRApproximate string matching: A simpler faster algorithmSIAM J. Comput.200231617611782195487710.1137/S0097539700370527 CislakAGrabowskiSHolubJSOPanG: online text searching over a pan-genomeBioinform.201834244290429210.1093/bioinformatics/bty506 LandauGMVishkinUEfficient string matching with k mismatchesTheor. Comput. Sci.19864323924985597410.1016/0304-3975(86)90178-7 Crochemore, M., Hancart, C., Lecroq, T.: Algorithms on strings. Cambridge University Press (2007) Equi, M., Norri, T., Alanko, J., Cazaux, B., Tomescu, A.I., Mäkinen, V.: Algorithms and complexity on indexing elastic founder graphs. In: Ahn, H., Sadakane, K. (eds.) 32nd International Symposium on Algorithms and Computation, ISAAC 2021, December 6-8, 2021, Fukuoka, Japan, LIPIcs, vol. 212, pp. 20:1–20:18. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2021). https://doi.org/10.4230/LIPIcs.ISAAC.2021.20 BernardiniGGaboryEPissisSPStougieLSweeringMZubaWCastañedaARodríguez-HenríquezFElastic-degenerate string matching with 1 errorLATIN 2022: Theoretical Informatics2022ChamSpringer International Publishing203710.1007/978-3-031-20624-5_2 Bender, M.A., Farach-Colton, M.: The LCA problem revisited. In: Gonnet, G.H., Panario, D., Viola, A. (eds.) LATIN 2000: Theoretical Informatics, 4th Latin American Symposium, Punta del Este, Uruguay, April 10-14, 2000, Proceedings, Lecture Notes in Computer Science, vol. 1776, pp. 88–94. Springer (2000). https://doi.org/10.1007/10719839_9 Na, J.C., Apostolico, A., Iliopoulos, C.S., Park, K.: Truncated suffix trees and their application to data compression. Theor. Comput. Sci. 304(1-3), 87–101 (2003). https://doi.org/10.1016/S0304-3975(03)00053-7 Farach, M.: Optimal suffix tree construction with large alphabets. In: 38th Annual Symposium On Foundations Of Computer Science, FOCS ’97, Miami Beach, Florida, USA, October 19-22, 1997, pp. 137–143. IEEE Computer Society (1997). https://doi.org/10.1109/SFCS.1997.646102 Akutsu, T.: A linear time pattern matching algorithm between a string and a tree. In: Apostolico, A., Crochemore, M., Galil, Z., Manber, U. (eds.) Combinatorial Pattern Matching, 4th Annual Symposium, CPM 93, Padova, Italy, June 2-4, 1993, Proceedings, Lecture Notes in Computer Science, vol. 684, pp. 1–10. Springer (1993). https://doi.org/10.1007/BFb0029792 IUPAC-IUB Commission on Biochemical NomenclatureAbbreviations and symbols for nucleic acids, polynucleotides, and their constituentsBiochemistry19709204022402710.1016/0022-2836(71)90319-6 Carletti, V., Foggia, P., Garrison, E., Greco, L., Ritrovato, P., Vento, M.: Graph-based representations for supporting genome data analysis and visualization: Opportunities and challenges. In: Conte, D., Ramel, J., Foggia, P. (eds.) Graph-Based Representations in Pattern Recognition - 12th IAPR-TC-15 International Workshop, GbRPR 2019, Tours, France, June 19-21, 2019, Proceedings, Lecture Notes in Computer Science, vol. 11510, pp. 237–246. Springer (2019). https://doi.org/10.1007/978-3-030-20081-7_23 NavarroGImproved approximate pattern matching on hypertextTheor. Comput. Sci.20002371–2455463175622310.1016/S0304-3975(99)00333-3 The Computational Pan-Genomics ConsortiumComputational pan-genomics: status, promises and challengesBriefings Bioinforma2018191118135 Alzamel, M., Ayad, L.A.K., Bernardini, G., Grossi, R., Iliopoulos, C.S., Pisanti, N., Pissis, S.P., Rosone, G.: Degenerate string comparison and applications. In: Parida, L., Ukkonen, E. (eds.) 18th International Workshop on Algorithms in Bioinformatics, WABI 2018, August 20-22, 2018, Helsinki, Finland, LIPIcs, vol. 113, pp. 21:1–21:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2018). https://doi.org/10.4230/LIPIcs.WABI.2018.21 Aoyama, K., Nakashima, Y., I, T., Inenaga, S., Bannai, H., Takeda, M.: Faster online elastic degenerate string matching. In: Navarro, G., Sankoff, D., Zhu, B. (eds.) Annual Symposium on Combinatorial Pattern Matching, CPM 2018, July 2-4, 2018 - Qingdao, China, LIPIcs, vol. 105, pp. 9:1–9:10. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2018). https://doi.org/10.4230/LIPIcs.CPM.2018.9 AmirALewensteinMLewensteinNPattern matching in hypertextJ. Algorithms20003518299174771910.1006/jagm.1999.1063 BernardiniGPisantiNPissisSPRosoneGApproximate pattern matching on elastic-degenerate textTheor. Comput. Sci 10194_CR1 ML Fredman (10194_CR26) 1984; 31 10194_CR2 The Computational Pan-Genomics Consortium (10194_CR47) 2018; 19 GM Landau (10194_CR34) 1986; 43 R Cole (10194_CR21) 2002; 31 10194_CR7 10194_CR8 10194_CR41 10194_CR20 G Navarro (10194_CR39) 2000; 237 10194_CR40 A Amir (10194_CR5) 2000; 35 10194_CR16 10194_CR38 10194_CR17 10194_CR14 10194_CR36 10194_CR37 Q Shi (10194_CR44) 2005; 34 10194_CR13 10194_CR10 10194_CR32 G Bernardini (10194_CR11) 2022; 51 B Chazelle (10194_CR18) 1988; 17 A Cislak (10194_CR19) 2018; 34 P Charalampopoulos (10194_CR15) 2020; 25 M Alzamel (10194_CR3) 2020; 175 A Amir (10194_CR4) 2000; 37 A Amir (10194_CR6) 2004; 50 G Bernardini (10194_CR12) 2020; 812 10194_CR30 10194_CR31 GM Landau (10194_CR35) 1988; 37 G Bernardini (10194_CR9) 2022 M Rautiainen (10194_CR42) 2019; 35 10194_CR27 10194_CR28 10194_CR25 IUPAC-IUB Commission on Biochemical Nomenclature (10194_CR33) 1970; 9 10194_CR23 10194_CR45 10194_CR24 10194_CR46 10194_CR43 10194_CR22 10194_CR29 |
| References_xml | – reference: Aoyama, K., Nakashima, Y., I, T., Inenaga, S., Bannai, H., Takeda, M.: Faster online elastic degenerate string matching. In: Navarro, G., Sankoff, D., Zhu, B. (eds.) Annual Symposium on Combinatorial Pattern Matching, CPM 2018, July 2-4, 2018 - Qingdao, China, LIPIcs, vol. 105, pp. 9:1–9:10. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2018). https://doi.org/10.4230/LIPIcs.CPM.2018.9 – reference: NavarroGImproved approximate pattern matching on hypertextTheor. Comput. Sci.20002371–2455463175622310.1016/S0304-3975(99)00333-3 – reference: Charalampopoulos, P., Kociumaka, T., Wellnitz, P.: Faster approximate pattern matching: A unified approach. In: Irani, S. (ed.) 61st IEEE Annual Symposium on Foundations of Computer Science, FOCS 2020, Durham, NC, USA, November 16-19, 2020, pp. 978–989. IEEE (2020). https://doi.org/10.1109/FOCS46700.2020.00095 – reference: Farach, M.: Optimal suffix tree construction with large alphabets. In: 38th Annual Symposium On Foundations Of Computer Science, FOCS ’97, Miami Beach, Florida, USA, October 19-22, 1997, pp. 137–143. IEEE Computer Society (1997). https://doi.org/10.1109/SFCS.1997.646102 – reference: Sleator, D.D., Tarjan, R.E.: A data structure for dynamic trees. J. Comput. Syst. Sci. 26(3), 362–391 (1983). https://doi.org/10.1016/0022-0000(83)90006-5. https://www.sciencedirect.com/science/article/pii/0022000083900065 – reference: Carletti, V., Foggia, P., Garrison, E., Greco, L., Ritrovato, P., Vento, M.: Graph-based representations for supporting genome data analysis and visualization: Opportunities and challenges. In: Conte, D., Ramel, J., Foggia, P. (eds.) Graph-Based Representations in Pattern Recognition - 12th IAPR-TC-15 International Workshop, GbRPR 2019, Tours, France, June 19-21, 2019, Proceedings, Lecture Notes in Computer Science, vol. 11510, pp. 237–246. Springer (2019). https://doi.org/10.1007/978-3-030-20081-7_23 – reference: The Computational Pan-Genomics ConsortiumComputational pan-genomics: status, promises and challengesBriefings Bioinforma2018191118135 – reference: AlzamelMAyadLAKBernardiniGGrossiRIliopoulosCSPisantiNPissisSPRosoneGComparing degenerate strings. Fundam. Informaticae20201751–4415810.3233/FI-2020-1947 – reference: Gawrychowski, P., Uznanski, P.: Towards unified approximate pattern matching for Hamming and l_1 distance. In: Chatzigiannakis, I., Kaklamanis, C., Marx, D., Sannella, D. (eds.) 45th International Colloquium on Automata, Languages, and Programming, ICALP 2018, July 9-13, 2018, Prague, Czech Republic, LIPIcs, vol. 107, pp. 62:1–62:13. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2018). https://doi.org/10.4230/LIPIcs.ICALP.2018.62 – reference: RautiainenMMäkinenVMarschallTBit-parallel sequence-to-graph alignment. Bioinform.201935193599360710.1093/bioinformatics/btz162 – reference: AmirALewensteinMLewensteinNPattern matching in hypertextJ. Algorithms20003518299174771910.1006/jagm.1999.1063 – reference: LandauGMVishkinUEfficient string matching with k mismatchesTheor. Comput. Sci.19864323924985597410.1016/0304-3975(86)90178-7 – reference: CharalampopoulosPIliopoulosCSLiuCPissisSPProperty suffix array with applications in indexing weighted sequencesACM J. Exp. Algorithmics202025116410286110.1145/3385898 – reference: Gibney, D.: An efficient elastic-degenerate text index? not likely. In: Boucher, C., Thankachan, S.V. (eds.) String Processing and Information Retrieval - 27th International Symposium, SPIRE 2020, Orlando, FL, USA, October 13-15, 2020, Proceedings, Lecture Notes in Computer Science, vol. 12303, pp. 76–88. Springer (2020). https://doi.org/10.1007/978-3-030-59212-7_6 – reference: Grossi, R., Iliopoulos, C.S., Liu, C., Pisanti, N., Pissis, S.P., Retha, A., Rosone, G., Vayani, F., Versari, L.: On-line pattern matching on similar texts. In: Kärkkäinen, J., Radoszewski, J., Rytter, W. (eds.) 28th Annual Symposium on Combinatorial Pattern Matching, CPM 2017, July 4-6, 2017, Warsaw, Poland, LIPIcs, vol. 78, pp. 9:1–9:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2017). https://doi.org/10.4230/LIPIcs.CPM.2017.9 – reference: ShiQJáJáJFNovel transformation techniques using q-heaps with applications to computational geometrySIAM J. Comput.200534614741492216575110.1137/S0097539703435728 – reference: Akutsu, T.: A linear time pattern matching algorithm between a string and a tree. In: Apostolico, A., Crochemore, M., Galil, Z., Manber, U. (eds.) Combinatorial Pattern Matching, 4th Annual Symposium, CPM 93, Padova, Italy, June 2-4, 1993, Proceedings, Lecture Notes in Computer Science, vol. 684, pp. 1–10. Springer (1993). https://doi.org/10.1007/BFb0029792 – reference: Na, J.C., Apostolico, A., Iliopoulos, C.S., Park, K.: Truncated suffix trees and their application to data compression. Theor. Comput. Sci. 304(1-3), 87–101 (2003). https://doi.org/10.1016/S0304-3975(03)00053-7 – reference: BernardiniGGaboryEPissisSPStougieLSweeringMZubaWCastañedaARodríguez-HenríquezFElastic-degenerate string matching with 1 errorLATIN 2022: Theoretical Informatics2022ChamSpringer International Publishing203710.1007/978-3-031-20624-5_2 – reference: Bender, M.A., Farach-Colton, M.: The LCA problem revisited. In: Gonnet, G.H., Panario, D., Viola, A. (eds.) LATIN 2000: Theoretical Informatics, 4th Latin American Symposium, Punta del Este, Uruguay, April 10-14, 2000, Proceedings, Lecture Notes in Computer Science, vol. 1776, pp. 88–94. Springer (2000). https://doi.org/10.1007/10719839_9 – reference: Equi, M., Norri, T., Alanko, J., Cazaux, B., Tomescu, A.I., Mäkinen, V.: Algorithms and complexity on indexing elastic founder graphs. In: Ahn, H., Sadakane, K. (eds.) 32nd International Symposium on Algorithms and Computation, ISAAC 2021, December 6-8, 2021, Fukuoka, Japan, LIPIcs, vol. 212, pp. 20:1–20:18. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2021). https://doi.org/10.4230/LIPIcs.ISAAC.2021.20 – reference: ChazelleBA functional approach to data structures and its use in multidimensional searchingSIAM J. Comput.198817342746294193710.1137/0217026 – reference: ColeRHariharanRApproximate string matching: A simpler faster algorithmSIAM J. Comput.200231617611782195487710.1137/S0097539700370527 – reference: Equi, M., Mäkinen, V., Tomescu, A.I., Grossi, R.: On the complexity of string matching for graphs. ACM Trans. Algorithms 19(3), 21:1–21:25 (2023). https://doi.org/10.1145/3588334 – reference: AmirALewensteinMPoratEFaster algorithms for string matching with k mismatchesJ. Algorithms2004502257275205302010.1016/S0196-6774(03)00097-X – reference: Pissis, S.P., Retha, A.: Dictionary matching in elastic-degenerate texts with applications in searching VCF files on-line. In: D’Angelo, G. (ed.) 17th International Symposium on Experimental Algorithms, SEA 2018, June 27-29, 2018, L’Aquila, Italy, LIPIcs, vol. 103, pp. 16:1–16:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2018). https://doi.org/10.4230/LIPIcs.SEA.2018.16 – reference: Charalampopoulos, P., Kociumaka, T., Wellnitz, P.: Faster pattern matching under edit distance : A reduction to dynamic puzzle matching and the seaweed monoid of permutation matrices. In: 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, Denver, CO, USA, October 31 - November 3, 2022, pp. 698–707. IEEE (2022). https://doi.org/10.1109/FOCS54457.2022.00072 – reference: Chan, T.M., Larsen, K.G., Patrascu, M.: Orthogonal range searching on the RAM, revisited. In: Hurtado, F., van Kreveld, M.J., (eds.) Proceedings of the 27th ACM Symposium on Computational Geometry, Paris, France, June 13-15, 2011, pp. 1–10. ACM (2011). https://doi.org/10.1145/1998196.1998198 – reference: Gao, Y., He, M., Nekrich, Y.: Fast preprocessing for optimal orthogonal range reporting and range successor with applications to text indexing. In: Grandoni, F., Herman, G., Sanders, P. (eds.) 28th Annual European Symposium on Algorithms, ESA 2020, September 7-9, 2020, Pisa, Italy (Virtual Conference), LIPIcs, vol. 173, pp. 54:1–54:18. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2020). https://doi.org/10.4230/LIPIcs.ESA.2020.54 – reference: BernardiniGPisantiNPissisSPRosoneGApproximate pattern matching on elastic-degenerate textTheor. Comput. Sci.2020812109122406677910.1016/j.tcs.2019.08.012 – reference: Mäkinen, V., Cazaux, B., Equi, M., Norri, T., Tomescu, A.I.: Linear time construction of indexable founder block graphs. In: Kingsford, C., Pisanti, N. (eds.) 20th International Workshop on Algorithms in Bioinformatics, WABI 2020, September 7-9, 2020, Pisa, Italy (Virtual Conference), LIPIcs, vol. 172, pp. 7:1–7:18. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2020). https://doi.org/10.4230/LIPIcs.WABI.2020.7 – reference: Iliopoulos, C.S., Kundu, R., Pissis, S.P.: Efficient pattern matching in elastic-degenerate strings. Inf. Comput. 279, 104,616 (2021). https://doi.org/10.1016/j.ic.2020.104616 – reference: Cole, R., Gottlieb, L., Lewenstein, M.: Dictionary matching and indexing with errors and don’t cares. In: Babai, L. (ed.) Proceedings of the 36th Annual ACM Symposium on Theory of Computing, Chicago, IL, USA, June 13-16, 2004, pp. 91–100. ACM (2004). https://doi.org/10.1145/1007352.1007374 – reference: IUPAC-IUB Commission on Biochemical NomenclatureAbbreviations and symbols for nucleic acids, polynucleotides, and their constituentsBiochemistry19709204022402710.1016/0022-2836(71)90319-6 – reference: LandauGMVishkinUFast string matching with k differencesJ. Comput. Syst. Sci.1988371637897365710.1016/0022-0000(88)90045-1 – reference: Crochemore, M., Hancart, C., Lecroq, T.: Algorithms on strings. Cambridge University Press (2007) – reference: Manber, U., Wu, S.: Approximate string matching with arbitrary costs for text and hypertext, pp. 22–33. https://doi.org/10.1142/9789812797919_0002. https://www.worldscientific.com/doi/abs/10.1142/9789812797919_0002 – reference: Alzamel, M., Ayad, L.A.K., Bernardini, G., Grossi, R., Iliopoulos, C.S., Pisanti, N., Pissis, S.P., Rosone, G.: Degenerate string comparison and applications. In: Parida, L., Ukkonen, E. (eds.) 18th International Workshop on Algorithms in Bioinformatics, WABI 2018, August 20-22, 2018, Helsinki, Finland, LIPIcs, vol. 113, pp. 21:1–21:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2018). https://doi.org/10.4230/LIPIcs.WABI.2018.21 – reference: CislakAGrabowskiSHolubJSOPanG: online text searching over a pan-genomeBioinform.201834244290429210.1093/bioinformatics/bty506 – reference: Ruzic, M.: Constructing efficient dictionaries in close to sorting time. In: Aceto, L., Damgård, I., Goldberg, L.A., Halldórsson, M.M., Ingólfsdóttir, A., Walukiewicz, I. (eds.) Automata, Languages and Programming, 35th International Colloquium, ICALP 2008, Reykjavik, Iceland, July 7-11, 2008, Proceedings, Part I: Tack A: Algorithms, Automata, Complexity, and Games, Lecture Notes in Computer Science, vol. 5125, pp. 84–95. Springer (2008). https://doi.org/10.1007/978-3-540-70575-8_8 – reference: Park, K., Kim, D.K.: String matching in hypertext. In: Galil, Z., Ukkonen, E., (eds.) Combinatorial Pattern Matching, 6th Annual Symposium, CPM 95, Espoo, Finland, July 5-7, 1995, Proceedings, Lecture Notes in Computer Science, vol. 937, pp. 318–329. Springer (1995). https://doi.org/10.1007/3-540-60044-2_51 – reference: BernardiniGGawrychowskiPPisantiNPissisSPRosoneGElastic-degenerate string matching via fast matrix multiplicationSIAM J. Comput.2022513549576442371410.1137/20M1368033 – reference: Bernardini, G., Gawrychowski, P., Pisanti, N., Pissis, S.P., Rosone, G.: Even faster elastic-degenerate string matching via fast matrix multiplication. In: Baier, C., Chatzigiannakis, I., Flocchini, P., Leonardi, S. (eds.) 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, July 9-12, 2019, Patras, Greece, LIPIcs, vol. 132, pp. 21:1–21:15. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2019). https://doi.org/10.4230/LIPIcs.ICALP.2019.21 – reference: FredmanMLKomlósJSzemerédiEStoring a sparse table with 0(1) worst case access timeJ. ACM198431353854481915610.1145/828.1884 – reference: Gawrychowski, P., Ghazawi, S., Landau, G.M.: On indeterminate strings matching. In: Gørtz, I.L., Weimann, O. (eds.) 31st Annual Symposium on Combinatorial Pattern Matching, CPM 2020, June 17-19, 2020, Copenhagen, Denmark, LIPIcs, vol. 161, pp. 14:1–14:14. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2020). https://doi.org/10.4230/LIPIcs.CPM.2020.14 – reference: Thankachan, S.V., Apostolico, A., Aluru, S.: A provably efficient algorithm for the k-mismatch average common substring problem. J. Comput. Biol. 23(6), 472–482 (2016). https://doi.org/10.1089/cmb.2015.0235. http://www.liebertpub.com/doi/10.1089/cmb.2015.0235 – reference: AmirAKeselmanDLandauGMLewensteinMLewensteinNRodehMText indexing and dictionary matching with one errorJ. Algorithms2000372309325178883810.1006/jagm.2000.1104 – ident: 10194_CR43 doi: 10.1007/978-3-540-70575-8_8 – ident: 10194_CR17 doi: 10.1109/FOCS54457.2022.00072 – ident: 10194_CR1 doi: 10.1007/BFb0029792 – ident: 10194_CR32 doi: 10.1016/j.ic.2020.104616 – ident: 10194_CR28 doi: 10.4230/LIPIcs.CPM.2020.14 – ident: 10194_CR22 doi: 10.1017/CBO9780511546853 – volume: 175 start-page: 41 issue: 1–4 year: 2020 ident: 10194_CR3 publication-title: Comparing degenerate strings. Fundam. Informaticae doi: 10.3233/FI-2020-1947 – ident: 10194_CR24 doi: 10.4230/LIPIcs.ISAAC.2021.20 – ident: 10194_CR20 doi: 10.1145/1007352.1007374 – volume: 19 start-page: 118 issue: 1 year: 2018 ident: 10194_CR47 publication-title: Briefings Bioinforma – ident: 10194_CR23 doi: 10.1145/3588334 – volume: 31 start-page: 1761 issue: 6 year: 2002 ident: 10194_CR21 publication-title: SIAM J. Comput. doi: 10.1137/S0097539700370527 – volume: 37 start-page: 309 issue: 2 year: 2000 ident: 10194_CR4 publication-title: J. Algorithms doi: 10.1006/jagm.2000.1104 – volume: 35 start-page: 3599 issue: 19 year: 2019 ident: 10194_CR42 publication-title: Bit-parallel sequence-to-graph alignment. Bioinform. doi: 10.1093/bioinformatics/btz162 – ident: 10194_CR2 doi: 10.4230/LIPIcs.WABI.2018.21 – ident: 10194_CR36 doi: 10.4230/LIPIcs.WABI.2020.7 – volume: 31 start-page: 538 issue: 3 year: 1984 ident: 10194_CR26 publication-title: J. ACM doi: 10.1145/828.1884 – volume: 37 start-page: 63 issue: 1 year: 1988 ident: 10194_CR35 publication-title: J. Comput. Syst. Sci. doi: 10.1016/0022-0000(88)90045-1 – volume: 43 start-page: 239 year: 1986 ident: 10194_CR34 publication-title: Theor. Comput. Sci. doi: 10.1016/0304-3975(86)90178-7 – volume: 51 start-page: 549 issue: 3 year: 2022 ident: 10194_CR11 publication-title: SIAM J. Comput. doi: 10.1137/20M1368033 – volume: 812 start-page: 109 year: 2020 ident: 10194_CR12 publication-title: Theor. Comput. Sci. doi: 10.1016/j.tcs.2019.08.012 – ident: 10194_CR31 doi: 10.4230/LIPIcs.CPM.2017.9 – volume: 17 start-page: 427 issue: 3 year: 1988 ident: 10194_CR18 publication-title: SIAM J. Comput. doi: 10.1137/0217026 – start-page: 20 volume-title: LATIN 2022: Theoretical Informatics year: 2022 ident: 10194_CR9 doi: 10.1007/978-3-031-20624-5_2 – volume: 9 start-page: 4022 issue: 20 year: 1970 ident: 10194_CR33 publication-title: Biochemistry doi: 10.1016/0022-2836(71)90319-6 – ident: 10194_CR16 doi: 10.1109/FOCS46700.2020.00095 – ident: 10194_CR13 doi: 10.1007/978-3-030-20081-7_23 – volume: 237 start-page: 455 issue: 1–2 year: 2000 ident: 10194_CR39 publication-title: Theor. Comput. Sci. doi: 10.1016/S0304-3975(99)00333-3 – ident: 10194_CR37 doi: 10.1142/9789812797919_0002 – volume: 34 start-page: 1474 issue: 6 year: 2005 ident: 10194_CR44 publication-title: SIAM J. Comput. doi: 10.1137/S0097539703435728 – ident: 10194_CR46 doi: 10.1089/cmb.2015.0235 – volume: 35 start-page: 82 issue: 1 year: 2000 ident: 10194_CR5 publication-title: J. Algorithms doi: 10.1006/jagm.1999.1063 – ident: 10194_CR45 doi: 10.1016/0022-0000(83)90006-5 – volume: 34 start-page: 4290 issue: 24 year: 2018 ident: 10194_CR19 publication-title: Bioinform. doi: 10.1093/bioinformatics/bty506 – ident: 10194_CR38 doi: 10.1016/S0304-3975(03)00053-7 – volume: 25 start-page: 1 year: 2020 ident: 10194_CR15 publication-title: ACM J. Exp. Algorithmics doi: 10.1145/3385898 – ident: 10194_CR29 doi: 10.4230/LIPIcs.ICALP.2018.62 – ident: 10194_CR14 doi: 10.1145/1998196.1998198 – ident: 10194_CR30 doi: 10.1007/978-3-030-59212-7_6 – volume: 50 start-page: 257 issue: 2 year: 2004 ident: 10194_CR6 publication-title: J. Algorithms doi: 10.1016/S0196-6774(03)00097-X – ident: 10194_CR8 doi: 10.1007/10719839_9 – ident: 10194_CR40 doi: 10.1007/3-540-60044-2_51 – ident: 10194_CR27 doi: 10.4230/LIPIcs.ESA.2020.54 – ident: 10194_CR41 doi: 10.4230/LIPIcs.SEA.2018.16 – ident: 10194_CR7 doi: 10.4230/LIPIcs.CPM.2018.9 – ident: 10194_CR10 doi: 10.4230/LIPIcs.ICALP.2019.21 – ident: 10194_CR25 doi: 10.1109/SFCS.1997.646102 |
| SSID | ssj0003800 |
| Score | 2.3752456 |
| Snippet | An elastic-degenerate (ED) string is a sequence of
n
finite sets of strings of total length
N
, introduced to represent a set of related DNA sequences, also... An elastic-degenerate (ED) string is a sequence of n finite sets of strings of total length N, introduced to represent a set of related DNA sequences, also... An elastic-degenerate (ED) string is a sequence of n finite sets of strings of total length N , introduced to represent a set of related DNA sequences, also... |
| SourceID | hal proquest crossref springer |
| SourceType | Open Access Repository Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 1442 |
| SubjectTerms | Algorithms Bioinformatics Combinatorial analysis Computational geometry Computer Science Errors Gene sequencing Pattern matching Rectangles String matching Theory of Computation |
| SummonAdditionalLinks | – databaseName: SpringerLINK Contemporary 1997-Present dbid: RSV link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnR3LTsMwzGLAAQ68EeOlCHGDSM2Stqk4TTC0w0CIx7Rb1SUpTIINtYXvx8na8hAggdRD1bhN5Udsx7ENcOhLT_PAht0ZT9BBkYImacCospWgQ82HqWvT2e-Fl5dyMIiuyqSwvDrtXoUk3UpdJ7s5dUNRp-DSga43lQ2YQ3UnrThe3_Tr9ZdLl3iChoDNB_K9MlXm-298UkeNB3sY8oOl-SU46nTO-fL__nYFlkobk7SnTLEKM2a8BstV_wZSivMaLF7UNVvzdTjpoCGNt_TM3Lta1IUhN4WdlCCcO3JJ7K4tYaSTZZOM4HUxyp_s2AbcnXduT7u0bK1AFRogBU1FKCMfjSOeaBTzYciRo8LhUAUtpSMtjfTSyE8ihUCJ9SFtx3QVCu0LLbRhfBNmx5Ox2QJiuBE2mphqlopIM3RLkcpSa_QcDZeqCazCcKzKuuO2_cVjXFdMdriKEVexw1Usm3BUv_M8rbrxK_QBEq4GtAWzu-1ebJ95At0l4ctX1oTdiq5xKaZ5zFnLC9BiY1ETjis6vg__POX238B3YKE1ZQXqsV2YLbIXswfz6rUY5dm-Y9839yTkog priority: 102 providerName: Springer Nature |
| Title | Elastic-Degenerate String Matching with 1 Error or Mismatch |
| URI | https://link.springer.com/article/10.1007/s00224-024-10194-8 https://www.proquest.com/docview/3120689319 https://inria.hal.science/hal-04747458 |
| Volume | 68 |
| WOSCitedRecordID | wos001321482600001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAVX databaseName: SpringerLINK Contemporary 1997-Present customDbUrl: eissn: 1433-0490 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0003800 issn: 1432-4350 databaseCode: RSV dateStart: 19970101 isFulltext: true titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22 providerName: Springer Nature |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpR1db9Mw8EQ3HuCBwWCiY6sixBtYxLFTO-IBbaPTJNaqWmEMXqLUdmDS1m5J2O_nznUyQGIvSNEpSi6JlTvfh893B_Aq1bEVQwq7c1Ggg6IlK8ohZ4YqQSsr5qVv03l6rCYTfXaWTcOCWx22VbYy0QtquzS0Rv5W8CQeonLl2fura0Zdoyi6Glpo9GCdJwknPv-oWCeJhfYpKGgSUGZQGoekGZ8655UXQw2Fgggdeab_UEy9H7Qt8jeb868wqdc-hxv_O-7H8CjYndHeilGewD232ISNtqdDFKb4Jjwcd3Vc66fwboTGNZ6yD-67r0_duGjW0BAjxPPbMCNayY14NKqqZRXhMT6vL-neM_h8OPp0cMRCuwVm0ChpWCmVzlI0mERhcerPlUAuU_O5GSbGZlY7HZdZWmQGkQryK6mLulHSptJK67jYgrXFcuGeQ-SEkxRhLC0vZWY5uqpIeW0tepNOaNMH3v7r3IRa5NQS4yLvqih7-uRIn9zTJ9d9eN09c7WqxHEn9kskYYdIRbSP9o5zuhZLdKFkqm94H3ZamuVh6tb5LcH68Kal-u3tf39y--63vYAHyYrdWMx3YK2pfrpduG9umvO6GkBPffk6gPX90WR6MvBsjHAcHxBMpgTVDOE0_YbwZHb6CzVF9fY |
| linkProvider | ProQuest |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V3Nb9MwFH_aBhJwYDBA6xhgITiBRRw7jSOE0LR16rS2QmKg3UxqO2zS1o4kG-Kf4m_kPTfJAInddkDKIYodR7F_fh9-XwAvEh052Sezu5A5Kiha8bzoC24pE3Tq5LQIZTo_j9LJRB8eZh-W4GcbC0NulS1NDITazS2dkb-RIo76yFxF9v7sG6eqUWRdbUtoLGCx7398R5Wtere3g-v7Mo53BwfbQ95UFeAWeW_NC5XqLEG5QOYOET5NJU5mOp3afmxd5rTXUZEleWaxU07qExULt6lyiXLKeSFx3GW4oRRuB3IVjLY7yi91CHlBEYQikZKoCdIJoXqBWXLkiEj4RKa4_oMRLh-RG-ZvMu5fZtnA7XZX_7d5ugd3G7mabS02wn1Y8rM1WG1rVrCGhK3BnXGXp7Z6AG8HqDzgLd_xX0P-7dqzjzVNCcN-wc2U0Uk1E2xQlvOS4TU-rk6p7SF8upYfegQrs_nMrwPz0iuyoBZOFCpzAlVxRLZ2DrVlL7XtgWjX1tgm1zqV_DgxXZbogAeDeDABD0b34FX3ztki08iVvZ8jZLqOlCR8uDUy9CxSqCKqRF-IHmy2GDENaarMJUB68LpF2WXzvz-5cfVoz-DW8GA8MqO9yf5juB0voM4jsQkrdXnun8BNe1EfV-XTsGkYfLlu9P0Cis5LIQ |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1bb9MwFD7aBkLjgcFgojDAQvAE1uLYaRwhhKa11aZ21SQu2ptJbQcmQTuSMMRf49dxjptkDIm97QEpD1HsOIr9-Vx8bgDPEh052Sezu5A5Kiha8bzoC24pE3Tq5KwIZTo_TNLpVB8fZ0cr8KuNhSG3ypYmBkLtFpbOyHekiKM-MleR7RSNW8TRYPTm9BunClJkaW3LaSwhMvY_f6D6Vr0-GOBaP4_j0fDd3j5vKgxwi3y45oVKdZagjCBzh2ifpRInNp3NbD-2LnPa66jIkjyz2CknVYoKh9tUuUQ55byQOO4qXEMunNAeG6e84wJSh_AXFEcoKimJmoCdELYXGCdH7ohEUGSK6wtMcfUzuWT-Ie_-ZaINnG-08T_P2W241cjbbHe5Qe7Aip9vwkZby4I1pG0Tbh52-Wuru_BqiEoF3vKB_xTycteeva1pehj2C-6njE6wmWDDslyUDK_Dk-ortd2D91fyQ1uwNl_M_X1gXnpFltXCiUJlTqCKjojXzqEW7aW2PRDtOhvb5GCnUiBfTJc9OmDDIDZMwIbRPXjRvXO6zEByae-nCJ-uIyUP39-dGHoWKVQdVaLPRA-2W7yYhmRV5hwsPXjZIu68-d-ffHD5aE_gBoLOTA6m44ewHi9RzyOxDWt1-d0_guv2rD6pysdh_zD4eNXg-w1lfFPH |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Elastic-Degenerate+String+Matching+with+1+Error+or+Mismatch&rft.jtitle=Theory+of+computing+systems&rft.au=Bernardini%2C+Giulia&rft.au=Gabory%2C+Esteban&rft.au=Pissis%2C+Solon+P&rft.au=Stougie%2C+Leen&rft.date=2024-10-01&rft.pub=Springer+Nature+B.V&rft.issn=1432-4350&rft.eissn=1433-0490&rft.volume=68&rft.issue=5&rft.spage=1442&rft.epage=1467&rft_id=info:doi/10.1007%2Fs00224-024-10194-8&rft.externalDBID=HAS_PDF_LINK |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1432-4350&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1432-4350&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1432-4350&client=summon |