Bidirectional String Anchors for Improved Text Indexing and Top- K Similarity Search

The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sampling mechanisms lack good guarantees on the expected size of their samples for different combinations of their input parameters. Furthermore, indexes constructed over minimizers samples lack good wo...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:IEEE transactions on knowledge and data engineering Ročník 35; číslo 11; s. 1 - 18
Hlavní autori: Loukides, Grigorios, Pissis, Solon P., Sweering, Michelle
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: IEEE 01.11.2023
Predmet:
ISSN:1041-4347, 1558-2191
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sampling mechanisms lack good guarantees on the expected size of their samples for different combinations of their input parameters. Furthermore, indexes constructed over minimizers samples lack good worst-case guarantees for on-line pattern searches. In response, we propose bidirectional string anchors (bd-anchors), a new string sampling mechanism. Given an integer <inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula>, our mechanism selects the lexicographically smallest rotation in every length-<inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula> fragment. We show that, like minimizers samples, bd-anchors samples are approximately uniform, locally consistent, and computable in linear time. Furthermore, our experiments demonstrate that the bd-anchors sample sizes decrease proportionally to <inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula>; and that these sizes are competitive to or smaller than the minimizers sample sizes. We theoretically justify these results by analyzing the expected size of bd-anchors samples. We also prove that computing a total order on the input alphabet which minimizes the bd-anchors sample size is NP-hard. We next highlight the benefits of bd-anchors in two important applications: text indexing and top-<inline-formula><tex-math notation="LaTeX">K</tex-math></inline-formula> similarity search. For the first application, we develop an index for performing on-line pattern searches in near-optimal time, and show experimentally that a simple implementation of our index is consistently faster for on-line pattern searches than an analogous implementation of a minimizers-based index; we also show that it is substantially faster than two classic text indexes. For the second application, we develop a heuristic for top-<inline-formula><tex-math notation="LaTeX">K</tex-math></inline-formula> similarity search under edit distance, and show experimentally that it is generally as accurate as the state-of-the-art tool for the same purpose but more than one order of magnitude faster .
AbstractList The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sampling mechanisms lack good guarantees on the expected size of their samples for different combinations of their input parameters. Furthermore, indexes constructed over minimizers samples lack good worst-case guarantees for on-line pattern searches. In response, we propose bidirectional string anchors (bd-anchors), a new string sampling mechanism. Given an integer <inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula>, our mechanism selects the lexicographically smallest rotation in every length-<inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula> fragment. We show that, like minimizers samples, bd-anchors samples are approximately uniform, locally consistent, and computable in linear time. Furthermore, our experiments demonstrate that the bd-anchors sample sizes decrease proportionally to <inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula>; and that these sizes are competitive to or smaller than the minimizers sample sizes. We theoretically justify these results by analyzing the expected size of bd-anchors samples. We also prove that computing a total order on the input alphabet which minimizes the bd-anchors sample size is NP-hard. We next highlight the benefits of bd-anchors in two important applications: text indexing and top-<inline-formula><tex-math notation="LaTeX">K</tex-math></inline-formula> similarity search. For the first application, we develop an index for performing on-line pattern searches in near-optimal time, and show experimentally that a simple implementation of our index is consistently faster for on-line pattern searches than an analogous implementation of a minimizers-based index; we also show that it is substantially faster than two classic text indexes. For the second application, we develop a heuristic for top-<inline-formula><tex-math notation="LaTeX">K</tex-math></inline-formula> similarity search under edit distance, and show experimentally that it is generally as accurate as the state-of-the-art tool for the same purpose but more than one order of magnitude faster .
Author Sweering, Michelle
Pissis, Solon P.
Loukides, Grigorios
Author_xml – sequence: 1
  givenname: Grigorios
  orcidid: 0000-0003-0888-5061
  surname: Loukides
  fullname: Loukides, Grigorios
  organization: King's College London, U.K
– sequence: 2
  givenname: Solon P.
  orcidid: 0000-0002-1445-1932
  surname: Pissis
  fullname: Pissis, Solon P.
  organization: CWI, Netherlands
– sequence: 3
  givenname: Michelle
  orcidid: 0000-0003-1200-6015
  surname: Sweering
  fullname: Sweering, Michelle
  organization: CWI, The Netherlands
BookMark eNp9kMFOwkAQhjcGEwF9ABMP-wLFme7Wbo-IoAQSD9Rzs92dypqyJdvGwNtLAwfjwdP8mfzfJPON2MA3nhi7R5ggQvaYr17mkxjieCJigamCKzbEJFFRjBkOThkkRlLI9IaN2vYLAFSqcMjyZ2ddINO5xuuab7rg_CeferNtQsurJvDlbh-ab7I8p0PHl97Soa9of9o0-4iv-MbtXK2D6458QzqY7S27rnTd0t1ljtnHYp7P3qL1--tyNl1HRkDWRUohlcJmqrJVkmaJtRVWyiBJoalMKBbyiVBrkjYVxpRCIYDQYHSqSguZGLP0fNeEpm0DVYVxne5f6YJ2dYFQ9HKKXk7Ryykuck4k_iH3we10OP7LPJwZR0S_-oAqVlL8AM9Tctc
CODEN ITKEEH
CitedBy_id crossref_primary_10_1007_s00778_025_00935_7
crossref_primary_10_1186_s13015_025_00270_0
crossref_primary_10_1016_j_softx_2025_102234
Cites_doi 10.1145/2508020.2508023
10.1145/872757.872770
10.1145/2213836.2213847
10.1017/CBO9780511546853
10.1093/bioinformatics/btx235
10.1186/gb-2009-10-3-r25
10.1145/3385898
10.1007/3-540-48194-X_17
10.1145/2093973.2093980
10.1186/gb-2014-15-3-r46
10.1007/978-3-319-43681-4_21
10.1007/978-3-031-04749-7_4
10.4153/CJM-1961-015-3
10.1109/ICDE.2013.6544886
10.1093/bioinformatics/btab313
10.1007/978-3-030-45257-5_3
10.1137/070685373
10.1145/3394486.3403099
10.1145/3313276.3316368
10.1017/CBO9780511809071
10.1145/1007352.1007374
10.1093/bioinformatics/btv022
10.1145/872757.872796
10.1093/bioinformatics/bty258
10.1093/bioinformatics/bty191
10.1145/2588555.2593675
10.1017/CBO9780511574931
10.1093/bioinformatics/btp324
10.1093/bioinformatics/btw279
10.1089/cmb.2018.0036
10.1093/bioinformatics/bty597
10.1109/TKDE.2015.2485213
10.1137/0222058
10.1145/1082036.1082039
10.1007/978-1-4684-2001-2_9
10.1006/jmbi.1990.9999
10.1145/1807167.1807266
10.1002/spe.2481
10.1016/0020-0190(80)90149-0
10.1016/j.cell.2013.09.006
10.1109/ICDE.1995.380415
10.1007/s00453-013-9860-6
10.1145/3375890
10.1145/1998196.1998198
10.1137/S0097539702402354
10.1007/978-3-030-45257-5_13
10.1093/nar/27.11.2369
10.1145/2591796.2591885
10.1093/bioinformatics/bth408
10.1109/SWAT.1973.13
10.1145/3307339.3342144
10.1093/bioinformatics/btw152
10.1007/3-540-08442-8_83
10.1145/828.1884
10.1007/11682462_64
10.1109/TCBB.2021.3136792
10.1093/bioinformatics/btaa472
10.1007/978-3-540-77974-2
10.1145/1217856.1217858
10.1006/jagm.2000.1104
10.1093/bioinformatics/btaa435
10.1089/cmb.2021.0599
10.1007/978-3-319-07959-2_28
10.14778/2732219.2732220
10.1609/aaai.v24i1.7527
10.1016/j.ic.2019.104462
10.1093/bioinformatics/btab156
10.1137/1.9781611974331.ch143
10.1145/509907.509915
10.7717/peerj.10805
10.1109/SFCS.1997.646102
10.1089/cmb.2020.0432
10.1137/1.9781611975499.13
10.1147/rd.312.0249
10.1007/s00778-016-0449-y
ContentType Journal Article
DBID 97E
RIA
RIE
AAYXX
CITATION
DOI 10.1109/TKDE.2022.3231780
DatabaseName IEEE Xplore (IEEE)
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE/IET Electronic Library (IEL) (UW System Shared)
CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library (IEL) (UW System Shared)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1558-2191
EndPage 18
ExternalDocumentID 10_1109_TKDE_2022_3231780
10018284
Genre orig-research
GroupedDBID -~X
.DC
0R~
29I
4.4
5GY
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABQJQ
ABVLG
ACGFO
ACIWK
AENEX
AGQYO
AHBIQ
AKJIK
AKQYR
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
EJD
F5P
HZ~
IEDLZ
IFIPE
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNS
RXW
TAE
TN5
UHB
AAYXX
CITATION
ID FETCH-LOGICAL-c309t-881eb3d98fdf5795ddf1f8c1e43aeb5e2346e1aae4d73ccb381003a0ca78bd093
IEDL.DBID RIE
ISICitedReferencesCount 7
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001089176900014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1041-4347
IngestDate Sat Nov 29 02:36:06 EST 2025
Tue Nov 18 21:35:28 EST 2025
Wed Aug 27 02:21:29 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 11
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
https://doi.org/10.15223/policy-029
https://doi.org/10.15223/policy-037
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c309t-881eb3d98fdf5795ddf1f8c1e43aeb5e2346e1aae4d73ccb381003a0ca78bd093
ORCID 0000-0002-1445-1932
0000-0003-1200-6015
0000-0003-0888-5061
OpenAccessLink https://hdl.handle.net/1871.1/ec28e28b-b166-4438-88d6-0aaf6e55e899
PageCount 18
ParticipantIDs crossref_citationtrail_10_1109_TKDE_2022_3231780
crossref_primary_10_1109_TKDE_2022_3231780
ieee_primary_10018284
PublicationCentury 2000
PublicationDate 2023-11-01
PublicationDateYYYYMMDD 2023-11-01
PublicationDate_xml – month: 11
  year: 2023
  text: 2023-11-01
  day: 01
PublicationDecade 2020
PublicationTitle IEEE transactions on knowledge and data engineering
PublicationTitleAbbrev TKDE
PublicationYear 2023
Publisher IEEE
Publisher_xml – name: IEEE
References ref13
ref57
ref12
ref15
ref59
ref14
ref58
ref11
ref10
ref54
(ref76) 2021
ref17
ref16
ref19
ref18
ref51
ref50
ref46
ref89
ref48
ref47
ref42
ref86
ref41
ref85
ref44
ref88
ref43
ref87
li (ref56) 2007
ref49
gao (ref34) 2020
loukides (ref60) 2021
ref8
ref7
ref9
ref4
ref3
ref6
ref82
ref81
ref40
ref84
ref83
korf (ref53) 2003
ref80
ref35
ref79
ref78
ref37
ref36
dinklage (ref23) 2020
ref31
ref75
ref74
ref33
ref77
munro (ref66) 2017
kahveci (ref45) 0
ref32
ref2
ref1
ref39
ref38
ramakrishnan (ref70) 2003
ref71
ref73
ref72
ref24
ref68
ref67
levenshtein (ref55) 1966; 10
ref26
ref25
ref69
ref20
ref64
ref63
ref22
ref21
ref65
ref28
fisikopoulos (ref30) 2011
ref27
ferragina (ref29) 0
ref62
ref61
baeza-yates (ref5) 2011
kociumaka (ref52) 2016
References_xml – ident: ref69
  doi: 10.1145/2508020.2508023
– ident: ref73
  doi: 10.1145/872757.872770
– ident: ref77
  doi: 10.1145/2213836.2213847
– ident: ref17
  doi: 10.1017/CBO9780511546853
– ident: ref65
  doi: 10.1093/bioinformatics/btx235
– year: 2003
  ident: ref70
  publication-title: Database Management Systems (3 ed)
– ident: ref54
  doi: 10.1186/gb-2009-10-3-r25
– ident: ref12
  doi: 10.1145/3385898
– ident: ref49
  doi: 10.1007/3-540-48194-X_17
– ident: ref82
  doi: 10.1145/2093973.2093980
– ident: ref80
  doi: 10.1186/gb-2014-15-3-r46
– ident: ref68
  doi: 10.1007/978-3-319-43681-4_21
– start-page: 351
  year: 0
  ident: ref45
  article-title: Efficient index structures for string databases
  publication-title: Proc Int Conf Very Large Data Bases
– ident: ref39
  doi: 10.1007/978-3-031-04749-7_4
– ident: ref72
  doi: 10.4153/CJM-1961-015-3
– ident: ref21
  doi: 10.1109/ICDE.2013.6544886
– ident: ref89
  doi: 10.1093/bioinformatics/btab313
– ident: ref26
  doi: 10.1007/978-3-030-45257-5_3
– ident: ref40
  doi: 10.1137/070685373
– ident: ref84
  doi: 10.1145/3394486.3403099
– ident: ref50
  doi: 10.1145/3313276.3316368
– ident: ref63
  doi: 10.1017/CBO9780511809071
– start-page: 64:1
  year: 2021
  ident: ref60
  article-title: Bidirectional string anchors: A new string sampling mechanism
  publication-title: Proc Annu Eur Symp Algorithms
– ident: ref15
  doi: 10.1145/1007352.1007374
– ident: ref22
  doi: 10.1093/bioinformatics/btv022
– ident: ref13
  doi: 10.1145/872757.872796
– ident: ref64
  doi: 10.1093/bioinformatics/bty258
– ident: ref58
  doi: 10.1093/bioinformatics/bty191
– start-page: 39:1
  year: 2020
  ident: ref23
  article-title: Practical performance of space efficient data structures for longest common extensions
  publication-title: Proc Annu Eur Symp Algorithms
– ident: ref20
  doi: 10.1145/2588555.2593675
– ident: ref38
  doi: 10.1017/CBO9780511574931
– ident: ref59
  doi: 10.1093/bioinformatics/btp324
– ident: ref14
  doi: 10.1093/bioinformatics/btw279
– ident: ref42
  doi: 10.1089/cmb.2018.0036
– ident: ref43
  doi: 10.1093/bioinformatics/bty597
– ident: ref41
  doi: 10.1109/TKDE.2015.2485213
– ident: ref62
  doi: 10.1137/0222058
– year: 2011
  ident: ref30
  article-title: An implementation of range trees with fractional cascading in C
– ident: ref28
  doi: 10.1145/1082036.1082039
– ident: ref47
  doi: 10.1007/978-1-4684-2001-2_9
– year: 2011
  ident: ref5
  publication-title: Modern Information Retrieval - The Concepts and Technology behind Search
– start-page: 303
  year: 2007
  ident: ref56
  article-title: VGRAM: Improving performance of approximate queries on string collections using variable-length grams
  publication-title: Proc Int Conf Very Large Data Bases
– ident: ref2
  doi: 10.1006/jmbi.1990.9999
– ident: ref85
  doi: 10.1145/1807167.1807266
– ident: ref36
  doi: 10.1002/spe.2481
– ident: ref10
  doi: 10.1016/0020-0190(80)90149-0
– ident: ref51
  doi: 10.1016/j.cell.2013.09.006
– ident: ref1
  doi: 10.1109/ICDE.1995.380415
– ident: ref16
  doi: 10.1007/s00453-013-9860-6
– ident: ref33
  doi: 10.1145/3375890
– ident: ref11
  doi: 10.1145/1998196.1998198
– start-page: 54:1
  year: 2020
  ident: ref34
  article-title: Fast preprocessing for optimal orthogonal range reporting and range successor with applications to text indexing
  publication-title: Proc Annu Eur Symp Algorithms
– ident: ref37
  doi: 10.1137/S0097539702402354
– ident: ref87
  doi: 10.1007/978-3-030-45257-5_13
– start-page: 28:1
  year: 2016
  ident: ref52
  article-title: Minimal suffix and rotation of a substring in optimal time
  publication-title: Proc Ann Symp Combinatorial Pattern Matching
– volume: 10
  start-page: 707
  year: 1966
  ident: ref55
  article-title: Binary codes capable of correcting deletions, insertions and reversals
  publication-title: Sov Phys Doklady
– ident: ref19
  doi: 10.1093/nar/27.11.2369
– ident: ref7
  doi: 10.1145/2591796.2591885
– ident: ref71
  doi: 10.1093/bioinformatics/bth408
– ident: ref79
  doi: 10.1109/SWAT.1973.13
– ident: ref18
  doi: 10.1145/3307339.3342144
– ident: ref57
  doi: 10.1093/bioinformatics/btw152
– ident: ref74
  doi: 10.1007/3-540-08442-8_83
– ident: ref32
  doi: 10.1145/828.1884
– ident: ref61
  doi: 10.1007/11682462_64
– ident: ref4
  doi: 10.1109/TCBB.2021.3136792
– ident: ref86
  doi: 10.1093/bioinformatics/btaa472
– ident: ref9
  doi: 10.1007/978-3-540-77974-2
– ident: ref46
  doi: 10.1145/1217856.1217858
– ident: ref3
  doi: 10.1006/jagm.2000.1104
– year: 2003
  ident: ref53
  publication-title: BLAST - An Essential Guide to the Basic Local Alignment Search Tool
– ident: ref44
  doi: 10.1093/bioinformatics/btaa435
– ident: ref31
  doi: 10.1089/cmb.2021.0599
– year: 0
  ident: ref29
  article-title: Pizza&Chili corpus-compressed indexes and their testbeds
– ident: ref35
  doi: 10.1007/978-3-319-07959-2_28
– ident: ref78
  doi: 10.14778/2732219.2732220
– ident: ref81
  doi: 10.1609/aaai.v24i1.7527
– year: 2021
  ident: ref76
  publication-title: CGAL Editorial Board
– ident: ref6
  doi: 10.1016/j.ic.2019.104462
– ident: ref67
  doi: 10.1093/bioinformatics/btab156
– ident: ref8
  doi: 10.1137/1.9781611974331.ch143
– ident: ref24
  doi: 10.1145/509907.509915
– ident: ref25
  doi: 10.7717/peerj.10805
– ident: ref27
  doi: 10.1109/SFCS.1997.646102
– ident: ref88
  doi: 10.1089/cmb.2020.0432
– ident: ref75
  doi: 10.1137/1.9781611975499.13
– ident: ref48
  doi: 10.1147/rd.312.0249
– start-page: 408
  year: 2017
  ident: ref66
  article-title: Space-efficient construction of compressed indexes in deterministic linear time
  publication-title: Proceedings of the 5th Annual ACM-SIAM Symposium on Discrete Algorithms
– ident: ref83
  doi: 10.1007/s00778-016-0449-y
SSID ssj0008781
Score 2.492749
Snippet The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sampling mechanisms lack good guarantees on the expected size...
SourceID crossref
ieee
SourceType Enrichment Source
Index Database
Publisher
StartPage 1
SubjectTerms Bioinformatics
Indexing
Random access memory
Search problems
Software
string algorithms
string sampling
Task analysis
text indexing
top-<inline-formula xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> <tex-math notation="LaTeX"> K</tex-math> </inline-formula> string similarity search
Upper bound
Title Bidirectional String Anchors for Improved Text Indexing and Top- K Similarity Search
URI https://ieeexplore.ieee.org/document/10018284
Volume 35
WOSCitedRecordID wos001089176900014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE/IET Electronic Library (IEL) (UW System Shared)
  customDbUrl:
  eissn: 1558-2191
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0008781
  issn: 1041-4347
  databaseCode: RIE
  dateStart: 19890101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFA46POjB6Zw4f5GDJ6Fb0nRNepy6oQyGsAm7lSQvgYF2Yz_8-02abuyi4KWU8EpLX_vyvpe870PogRgQkMQOm6iuOwhqIyktjcBlczxTLkOGIDbBRyMxnWbvVbN62QtjjCk3n5m2Py3X8mGuN75U1vF8QQ4hJIfokHMemrV2YVfwUpHUwQt3T5bwagmTkqwzGb70HRSM4zZz6Qz3FJB7k9Ceqko5qQzq_3ycM3RaZY-4F9x9jg5M0UD1rTIDrn7UBjrZoxm8QJOnWZi5yrIfHq_9MO4VLvItV9hlrTiUFgzgiYvV-M0zKHoTWbiR-SLCQzyefc0cCHY5Ow47lJvoY9CfPL9GlZpCpBnJ1pEQ1AFnyIQF2-VZF8BSKzQ1CZNGdU3MktRQKU0CnGmtPPUXYZJoyYUCkrFLVCvmhblCmMSSpSoFIi0kinAFVmkAYWlCpWBpC5Ht6811RTXuFS8-8xJykCz3Hsm9R_LKIy30uLtkEXg2_jJuem_sGQZHXP8yfoOOvUp8aCG8RbX1cmPu0JH-Xs9Wy_vyM_oBYATFHA
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NT9wwEB21FKn0UAoFQcuHDz0hBezYWTtHyodAS1eVCBK3yPbY0kptFi0Lv7_jOKC9tFIvUWRN5MiT2PPGnvcAvvGABlVJ2MRVdDEiFtZGUSBFc7p2FCFjFpvQk4m5v69_DsXqfS1MCKE_fBaO022_l48z_5RSZSeJL4gQgnoL7ypFHeRyrdeJ1-hek5QABvUqlR42MQWvT5rx-QWBwbI8lhTQ6EQCubQMLemq9MvK5fp_vtAn-DjEj-w0O3wD3oRuE9ZftBnY8KtuwoclosHP0Hyf5rWrT_yx20VqZqcdzX3zR0ZxK8vJhYCsodmaXScOxWRiO2qZPRRszG6nv6cEgylqZ_mM8hbcXV40Z1fFoKdQeMnrRWGMIOiMtYkYK11XiFFE40VQ0gZXhVKqURDWBoVaeu8S-ReXlnurjUNey21Y6WZd2AHGSytHboTcRlSOa4fReUQThRLWyNEu8Jfhbf1ANp40L361PejgdZs80iaPtINHduHo9ZGHzLTxL-Ot5I0lw-yIL39pP4T3V82Pm_bmejL-CmtJMz4XFO7BymL-FPZh1T8vpo_zg_6T-gPME8hj
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Bidirectional+String+Anchors+for+Improved+Text+Indexing+and+Top-%24K%24+Similarity+Search&rft.jtitle=IEEE+transactions+on+knowledge+and+data+engineering&rft.au=Loukides%2C+Grigorios&rft.au=Pissis%2C+Solon+P.&rft.au=Sweering%2C+Michelle&rft.date=2023-11-01&rft.issn=1041-4347&rft.eissn=1558-2191&rft.volume=35&rft.issue=11&rft.spage=11093&rft.epage=11111&rft_id=info:doi/10.1109%2FTKDE.2022.3231780&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TKDE_2022_3231780
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1041-4347&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1041-4347&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1041-4347&client=summon