Bidirectional String Anchors for Improved Text Indexing and Top- K Similarity Search
The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sampling mechanisms lack good guarantees on the expected size of their samples for different combinations of their input parameters. Furthermore, indexes constructed over minimizers samples lack good wo...
Uložené v:
| Vydané v: | IEEE transactions on knowledge and data engineering Ročník 35; číslo 11; s. 1 - 18 |
|---|---|
| Hlavní autori: | , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
IEEE
01.11.2023
|
| Predmet: | |
| ISSN: | 1041-4347, 1558-2191 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sampling mechanisms lack good guarantees on the expected size of their samples for different combinations of their input parameters. Furthermore, indexes constructed over minimizers samples lack good worst-case guarantees for on-line pattern searches. In response, we propose bidirectional string anchors (bd-anchors), a new string sampling mechanism. Given an integer <inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula>, our mechanism selects the lexicographically smallest rotation in every length-<inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula> fragment. We show that, like minimizers samples, bd-anchors samples are approximately uniform, locally consistent, and computable in linear time. Furthermore, our experiments demonstrate that the bd-anchors sample sizes decrease proportionally to <inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula>; and that these sizes are competitive to or smaller than the minimizers sample sizes. We theoretically justify these results by analyzing the expected size of bd-anchors samples. We also prove that computing a total order on the input alphabet which minimizes the bd-anchors sample size is NP-hard. We next highlight the benefits of bd-anchors in two important applications: text indexing and top-<inline-formula><tex-math notation="LaTeX">K</tex-math></inline-formula> similarity search. For the first application, we develop an index for performing on-line pattern searches in near-optimal time, and show experimentally that a simple implementation of our index is consistently faster for on-line pattern searches than an analogous implementation of a minimizers-based index; we also show that it is substantially faster than two classic text indexes. For the second application, we develop a heuristic for top-<inline-formula><tex-math notation="LaTeX">K</tex-math></inline-formula> similarity search under edit distance, and show experimentally that it is generally as accurate as the state-of-the-art tool for the same purpose but more than one order of magnitude faster . |
|---|---|
| AbstractList | The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sampling mechanisms lack good guarantees on the expected size of their samples for different combinations of their input parameters. Furthermore, indexes constructed over minimizers samples lack good worst-case guarantees for on-line pattern searches. In response, we propose bidirectional string anchors (bd-anchors), a new string sampling mechanism. Given an integer <inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula>, our mechanism selects the lexicographically smallest rotation in every length-<inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula> fragment. We show that, like minimizers samples, bd-anchors samples are approximately uniform, locally consistent, and computable in linear time. Furthermore, our experiments demonstrate that the bd-anchors sample sizes decrease proportionally to <inline-formula><tex-math notation="LaTeX">\ell</tex-math></inline-formula>; and that these sizes are competitive to or smaller than the minimizers sample sizes. We theoretically justify these results by analyzing the expected size of bd-anchors samples. We also prove that computing a total order on the input alphabet which minimizes the bd-anchors sample size is NP-hard. We next highlight the benefits of bd-anchors in two important applications: text indexing and top-<inline-formula><tex-math notation="LaTeX">K</tex-math></inline-formula> similarity search. For the first application, we develop an index for performing on-line pattern searches in near-optimal time, and show experimentally that a simple implementation of our index is consistently faster for on-line pattern searches than an analogous implementation of a minimizers-based index; we also show that it is substantially faster than two classic text indexes. For the second application, we develop a heuristic for top-<inline-formula><tex-math notation="LaTeX">K</tex-math></inline-formula> similarity search under edit distance, and show experimentally that it is generally as accurate as the state-of-the-art tool for the same purpose but more than one order of magnitude faster . |
| Author | Sweering, Michelle Pissis, Solon P. Loukides, Grigorios |
| Author_xml | – sequence: 1 givenname: Grigorios orcidid: 0000-0003-0888-5061 surname: Loukides fullname: Loukides, Grigorios organization: King's College London, U.K – sequence: 2 givenname: Solon P. orcidid: 0000-0002-1445-1932 surname: Pissis fullname: Pissis, Solon P. organization: CWI, Netherlands – sequence: 3 givenname: Michelle orcidid: 0000-0003-1200-6015 surname: Sweering fullname: Sweering, Michelle organization: CWI, The Netherlands |
| BookMark | eNp9kMFOwkAQhjcGEwF9ABMP-wLFme7Wbo-IoAQSD9Rzs92dypqyJdvGwNtLAwfjwdP8mfzfJPON2MA3nhi7R5ggQvaYr17mkxjieCJigamCKzbEJFFRjBkOThkkRlLI9IaN2vYLAFSqcMjyZ2ddINO5xuuab7rg_CeferNtQsurJvDlbh-ab7I8p0PHl97Soa9of9o0-4iv-MbtXK2D6458QzqY7S27rnTd0t1ljtnHYp7P3qL1--tyNl1HRkDWRUohlcJmqrJVkmaJtRVWyiBJoalMKBbyiVBrkjYVxpRCIYDQYHSqSguZGLP0fNeEpm0DVYVxne5f6YJ2dYFQ9HKKXk7Ryykuck4k_iH3we10OP7LPJwZR0S_-oAqVlL8AM9Tctc |
| CODEN | ITKEEH |
| CitedBy_id | crossref_primary_10_1007_s00778_025_00935_7 crossref_primary_10_1186_s13015_025_00270_0 crossref_primary_10_1016_j_softx_2025_102234 |
| Cites_doi | 10.1145/2508020.2508023 10.1145/872757.872770 10.1145/2213836.2213847 10.1017/CBO9780511546853 10.1093/bioinformatics/btx235 10.1186/gb-2009-10-3-r25 10.1145/3385898 10.1007/3-540-48194-X_17 10.1145/2093973.2093980 10.1186/gb-2014-15-3-r46 10.1007/978-3-319-43681-4_21 10.1007/978-3-031-04749-7_4 10.4153/CJM-1961-015-3 10.1109/ICDE.2013.6544886 10.1093/bioinformatics/btab313 10.1007/978-3-030-45257-5_3 10.1137/070685373 10.1145/3394486.3403099 10.1145/3313276.3316368 10.1017/CBO9780511809071 10.1145/1007352.1007374 10.1093/bioinformatics/btv022 10.1145/872757.872796 10.1093/bioinformatics/bty258 10.1093/bioinformatics/bty191 10.1145/2588555.2593675 10.1017/CBO9780511574931 10.1093/bioinformatics/btp324 10.1093/bioinformatics/btw279 10.1089/cmb.2018.0036 10.1093/bioinformatics/bty597 10.1109/TKDE.2015.2485213 10.1137/0222058 10.1145/1082036.1082039 10.1007/978-1-4684-2001-2_9 10.1006/jmbi.1990.9999 10.1145/1807167.1807266 10.1002/spe.2481 10.1016/0020-0190(80)90149-0 10.1016/j.cell.2013.09.006 10.1109/ICDE.1995.380415 10.1007/s00453-013-9860-6 10.1145/3375890 10.1145/1998196.1998198 10.1137/S0097539702402354 10.1007/978-3-030-45257-5_13 10.1093/nar/27.11.2369 10.1145/2591796.2591885 10.1093/bioinformatics/bth408 10.1109/SWAT.1973.13 10.1145/3307339.3342144 10.1093/bioinformatics/btw152 10.1007/3-540-08442-8_83 10.1145/828.1884 10.1007/11682462_64 10.1109/TCBB.2021.3136792 10.1093/bioinformatics/btaa472 10.1007/978-3-540-77974-2 10.1145/1217856.1217858 10.1006/jagm.2000.1104 10.1093/bioinformatics/btaa435 10.1089/cmb.2021.0599 10.1007/978-3-319-07959-2_28 10.14778/2732219.2732220 10.1609/aaai.v24i1.7527 10.1016/j.ic.2019.104462 10.1093/bioinformatics/btab156 10.1137/1.9781611974331.ch143 10.1145/509907.509915 10.7717/peerj.10805 10.1109/SFCS.1997.646102 10.1089/cmb.2020.0432 10.1137/1.9781611975499.13 10.1147/rd.312.0249 10.1007/s00778-016-0449-y |
| ContentType | Journal Article |
| DBID | 97E RIA RIE AAYXX CITATION |
| DOI | 10.1109/TKDE.2022.3231780 |
| DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE/IET Electronic Library (IEL) (UW System Shared) CrossRef |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) (UW System Shared) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering Computer Science |
| EISSN | 1558-2191 |
| EndPage | 18 |
| ExternalDocumentID | 10_1109_TKDE_2022_3231780 10018284 |
| Genre | orig-research |
| GroupedDBID | -~X .DC 0R~ 29I 4.4 5GY 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFO ACIWK AENEX AGQYO AHBIQ AKJIK AKQYR ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD F5P HZ~ IEDLZ IFIPE IPLJI JAVBF LAI M43 MS~ O9- OCL P2P PQQKQ RIA RIE RNS RXW TAE TN5 UHB AAYXX CITATION |
| ID | FETCH-LOGICAL-c309t-881eb3d98fdf5795ddf1f8c1e43aeb5e2346e1aae4d73ccb381003a0ca78bd093 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 7 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001089176900014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1041-4347 |
| IngestDate | Sat Nov 29 02:36:06 EST 2025 Tue Nov 18 21:35:28 EST 2025 Wed Aug 27 02:21:29 EDT 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 11 |
| Language | English |
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c309t-881eb3d98fdf5795ddf1f8c1e43aeb5e2346e1aae4d73ccb381003a0ca78bd093 |
| ORCID | 0000-0002-1445-1932 0000-0003-1200-6015 0000-0003-0888-5061 |
| OpenAccessLink | https://hdl.handle.net/1871.1/ec28e28b-b166-4438-88d6-0aaf6e55e899 |
| PageCount | 18 |
| ParticipantIDs | crossref_citationtrail_10_1109_TKDE_2022_3231780 crossref_primary_10_1109_TKDE_2022_3231780 ieee_primary_10018284 |
| PublicationCentury | 2000 |
| PublicationDate | 2023-11-01 |
| PublicationDateYYYYMMDD | 2023-11-01 |
| PublicationDate_xml | – month: 11 year: 2023 text: 2023-11-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationTitle | IEEE transactions on knowledge and data engineering |
| PublicationTitleAbbrev | TKDE |
| PublicationYear | 2023 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| References | ref13 ref57 ref12 ref15 ref59 ref14 ref58 ref11 ref10 ref54 (ref76) 2021 ref17 ref16 ref19 ref18 ref51 ref50 ref46 ref89 ref48 ref47 ref42 ref86 ref41 ref85 ref44 ref88 ref43 ref87 li (ref56) 2007 ref49 gao (ref34) 2020 loukides (ref60) 2021 ref8 ref7 ref9 ref4 ref3 ref6 ref82 ref81 ref40 ref84 ref83 korf (ref53) 2003 ref80 ref35 ref79 ref78 ref37 ref36 dinklage (ref23) 2020 ref31 ref75 ref74 ref33 ref77 munro (ref66) 2017 kahveci (ref45) 0 ref32 ref2 ref1 ref39 ref38 ramakrishnan (ref70) 2003 ref71 ref73 ref72 ref24 ref68 ref67 levenshtein (ref55) 1966; 10 ref26 ref25 ref69 ref20 ref64 ref63 ref22 ref21 ref65 ref28 fisikopoulos (ref30) 2011 ref27 ferragina (ref29) 0 ref62 ref61 baeza-yates (ref5) 2011 kociumaka (ref52) 2016 |
| References_xml | – ident: ref69 doi: 10.1145/2508020.2508023 – ident: ref73 doi: 10.1145/872757.872770 – ident: ref77 doi: 10.1145/2213836.2213847 – ident: ref17 doi: 10.1017/CBO9780511546853 – ident: ref65 doi: 10.1093/bioinformatics/btx235 – year: 2003 ident: ref70 publication-title: Database Management Systems (3 ed) – ident: ref54 doi: 10.1186/gb-2009-10-3-r25 – ident: ref12 doi: 10.1145/3385898 – ident: ref49 doi: 10.1007/3-540-48194-X_17 – ident: ref82 doi: 10.1145/2093973.2093980 – ident: ref80 doi: 10.1186/gb-2014-15-3-r46 – ident: ref68 doi: 10.1007/978-3-319-43681-4_21 – start-page: 351 year: 0 ident: ref45 article-title: Efficient index structures for string databases publication-title: Proc Int Conf Very Large Data Bases – ident: ref39 doi: 10.1007/978-3-031-04749-7_4 – ident: ref72 doi: 10.4153/CJM-1961-015-3 – ident: ref21 doi: 10.1109/ICDE.2013.6544886 – ident: ref89 doi: 10.1093/bioinformatics/btab313 – ident: ref26 doi: 10.1007/978-3-030-45257-5_3 – ident: ref40 doi: 10.1137/070685373 – ident: ref84 doi: 10.1145/3394486.3403099 – ident: ref50 doi: 10.1145/3313276.3316368 – ident: ref63 doi: 10.1017/CBO9780511809071 – start-page: 64:1 year: 2021 ident: ref60 article-title: Bidirectional string anchors: A new string sampling mechanism publication-title: Proc Annu Eur Symp Algorithms – ident: ref15 doi: 10.1145/1007352.1007374 – ident: ref22 doi: 10.1093/bioinformatics/btv022 – ident: ref13 doi: 10.1145/872757.872796 – ident: ref64 doi: 10.1093/bioinformatics/bty258 – ident: ref58 doi: 10.1093/bioinformatics/bty191 – start-page: 39:1 year: 2020 ident: ref23 article-title: Practical performance of space efficient data structures for longest common extensions publication-title: Proc Annu Eur Symp Algorithms – ident: ref20 doi: 10.1145/2588555.2593675 – ident: ref38 doi: 10.1017/CBO9780511574931 – ident: ref59 doi: 10.1093/bioinformatics/btp324 – ident: ref14 doi: 10.1093/bioinformatics/btw279 – ident: ref42 doi: 10.1089/cmb.2018.0036 – ident: ref43 doi: 10.1093/bioinformatics/bty597 – ident: ref41 doi: 10.1109/TKDE.2015.2485213 – ident: ref62 doi: 10.1137/0222058 – year: 2011 ident: ref30 article-title: An implementation of range trees with fractional cascading in C – ident: ref28 doi: 10.1145/1082036.1082039 – ident: ref47 doi: 10.1007/978-1-4684-2001-2_9 – year: 2011 ident: ref5 publication-title: Modern Information Retrieval - The Concepts and Technology behind Search – start-page: 303 year: 2007 ident: ref56 article-title: VGRAM: Improving performance of approximate queries on string collections using variable-length grams publication-title: Proc Int Conf Very Large Data Bases – ident: ref2 doi: 10.1006/jmbi.1990.9999 – ident: ref85 doi: 10.1145/1807167.1807266 – ident: ref36 doi: 10.1002/spe.2481 – ident: ref10 doi: 10.1016/0020-0190(80)90149-0 – ident: ref51 doi: 10.1016/j.cell.2013.09.006 – ident: ref1 doi: 10.1109/ICDE.1995.380415 – ident: ref16 doi: 10.1007/s00453-013-9860-6 – ident: ref33 doi: 10.1145/3375890 – ident: ref11 doi: 10.1145/1998196.1998198 – start-page: 54:1 year: 2020 ident: ref34 article-title: Fast preprocessing for optimal orthogonal range reporting and range successor with applications to text indexing publication-title: Proc Annu Eur Symp Algorithms – ident: ref37 doi: 10.1137/S0097539702402354 – ident: ref87 doi: 10.1007/978-3-030-45257-5_13 – start-page: 28:1 year: 2016 ident: ref52 article-title: Minimal suffix and rotation of a substring in optimal time publication-title: Proc Ann Symp Combinatorial Pattern Matching – volume: 10 start-page: 707 year: 1966 ident: ref55 article-title: Binary codes capable of correcting deletions, insertions and reversals publication-title: Sov Phys Doklady – ident: ref19 doi: 10.1093/nar/27.11.2369 – ident: ref7 doi: 10.1145/2591796.2591885 – ident: ref71 doi: 10.1093/bioinformatics/bth408 – ident: ref79 doi: 10.1109/SWAT.1973.13 – ident: ref18 doi: 10.1145/3307339.3342144 – ident: ref57 doi: 10.1093/bioinformatics/btw152 – ident: ref74 doi: 10.1007/3-540-08442-8_83 – ident: ref32 doi: 10.1145/828.1884 – ident: ref61 doi: 10.1007/11682462_64 – ident: ref4 doi: 10.1109/TCBB.2021.3136792 – ident: ref86 doi: 10.1093/bioinformatics/btaa472 – ident: ref9 doi: 10.1007/978-3-540-77974-2 – ident: ref46 doi: 10.1145/1217856.1217858 – ident: ref3 doi: 10.1006/jagm.2000.1104 – year: 2003 ident: ref53 publication-title: BLAST - An Essential Guide to the Basic Local Alignment Search Tool – ident: ref44 doi: 10.1093/bioinformatics/btaa435 – ident: ref31 doi: 10.1089/cmb.2021.0599 – year: 0 ident: ref29 article-title: Pizza&Chili corpus-compressed indexes and their testbeds – ident: ref35 doi: 10.1007/978-3-319-07959-2_28 – ident: ref78 doi: 10.14778/2732219.2732220 – ident: ref81 doi: 10.1609/aaai.v24i1.7527 – year: 2021 ident: ref76 publication-title: CGAL Editorial Board – ident: ref6 doi: 10.1016/j.ic.2019.104462 – ident: ref67 doi: 10.1093/bioinformatics/btab156 – ident: ref8 doi: 10.1137/1.9781611974331.ch143 – ident: ref24 doi: 10.1145/509907.509915 – ident: ref25 doi: 10.7717/peerj.10805 – ident: ref27 doi: 10.1109/SFCS.1997.646102 – ident: ref88 doi: 10.1089/cmb.2020.0432 – ident: ref75 doi: 10.1137/1.9781611975499.13 – ident: ref48 doi: 10.1147/rd.312.0249 – start-page: 408 year: 2017 ident: ref66 article-title: Space-efficient construction of compressed indexes in deterministic linear time publication-title: Proceedings of the 5th Annual ACM-SIAM Symposium on Discrete Algorithms – ident: ref83 doi: 10.1007/s00778-016-0449-y |
| SSID | ssj0008781 |
| Score | 2.492749 |
| Snippet | The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sampling mechanisms lack good guarantees on the expected size... |
| SourceID | crossref ieee |
| SourceType | Enrichment Source Index Database Publisher |
| StartPage | 1 |
| SubjectTerms | Bioinformatics Indexing Random access memory Search problems Software string algorithms string sampling Task analysis text indexing top-<inline-formula xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> <tex-math notation="LaTeX"> K</tex-math> </inline-formula> string similarity search Upper bound |
| Title | Bidirectional String Anchors for Improved Text Indexing and Top- K Similarity Search |
| URI | https://ieeexplore.ieee.org/document/10018284 |
| Volume | 35 |
| WOSCitedRecordID | wos001089176900014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE/IET Electronic Library (IEL) (UW System Shared) customDbUrl: eissn: 1558-2191 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0008781 issn: 1041-4347 databaseCode: RIE dateStart: 19890101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFA46POjB6Zw4f5GDJ6Fb0nRNepy6oQyGsAm7lSQvgYF2Yz_8-02abuyi4KWU8EpLX_vyvpe870PogRgQkMQOm6iuOwhqIyktjcBlczxTLkOGIDbBRyMxnWbvVbN62QtjjCk3n5m2Py3X8mGuN75U1vF8QQ4hJIfokHMemrV2YVfwUpHUwQt3T5bwagmTkqwzGb70HRSM4zZz6Qz3FJB7k9Ceqko5qQzq_3ycM3RaZY-4F9x9jg5M0UD1rTIDrn7UBjrZoxm8QJOnWZi5yrIfHq_9MO4VLvItV9hlrTiUFgzgiYvV-M0zKHoTWbiR-SLCQzyefc0cCHY5Ow47lJvoY9CfPL9GlZpCpBnJ1pEQ1AFnyIQF2-VZF8BSKzQ1CZNGdU3MktRQKU0CnGmtPPUXYZJoyYUCkrFLVCvmhblCmMSSpSoFIi0kinAFVmkAYWlCpWBpC5Ht6811RTXuFS8-8xJykCz3Hsm9R_LKIy30uLtkEXg2_jJuem_sGQZHXP8yfoOOvUp8aCG8RbX1cmPu0JH-Xs9Wy_vyM_oBYATFHA |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NT9wwEB21FKn0UAoFQcuHDz0hBezYWTtHyodAS1eVCBK3yPbY0kptFi0Lv7_jOKC9tFIvUWRN5MiT2PPGnvcAvvGABlVJ2MRVdDEiFtZGUSBFc7p2FCFjFpvQk4m5v69_DsXqfS1MCKE_fBaO022_l48z_5RSZSeJL4gQgnoL7ypFHeRyrdeJ1-hek5QABvUqlR42MQWvT5rx-QWBwbI8lhTQ6EQCubQMLemq9MvK5fp_vtAn-DjEj-w0O3wD3oRuE9ZftBnY8KtuwoclosHP0Hyf5rWrT_yx20VqZqcdzX3zR0ZxK8vJhYCsodmaXScOxWRiO2qZPRRszG6nv6cEgylqZ_mM8hbcXV40Z1fFoKdQeMnrRWGMIOiMtYkYK11XiFFE40VQ0gZXhVKqURDWBoVaeu8S-ReXlnurjUNey21Y6WZd2AHGSytHboTcRlSOa4fReUQThRLWyNEu8Jfhbf1ANp40L361PejgdZs80iaPtINHduHo9ZGHzLTxL-Ot5I0lw-yIL39pP4T3V82Pm_bmejL-CmtJMz4XFO7BymL-FPZh1T8vpo_zg_6T-gPME8hj |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Bidirectional+String+Anchors+for+Improved+Text+Indexing+and+Top-%24K%24+Similarity+Search&rft.jtitle=IEEE+transactions+on+knowledge+and+data+engineering&rft.au=Loukides%2C+Grigorios&rft.au=Pissis%2C+Solon+P.&rft.au=Sweering%2C+Michelle&rft.date=2023-11-01&rft.issn=1041-4347&rft.eissn=1558-2191&rft.volume=35&rft.issue=11&rft.spage=11093&rft.epage=11111&rft_id=info:doi/10.1109%2FTKDE.2022.3231780&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TKDE_2022_3231780 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1041-4347&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1041-4347&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1041-4347&client=summon |