Výsledky vyhledávání - Text Compression and Indexing Algorithms

  1. 1
  2. 2
  3. 3

    Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform Autor Cox, Anthony J., Bauer, Markus J., Jakobi, Tobias, Rosone, Giovanna

    ISSN: 1367-4803, 1367-4811, 1367-4811
    Vydáno: Oxford Oxford University Press 01.06.2012
    Vydáno v Bioinformatics (Oxford, England) (01.06.2012)
    “…Motivation: The Burrows–Wheeler transform (BWT) is the foundation of many algorithms for compression and indexing of text data, but the cost of computing…”
    Získat plný text
    Journal Article
  4. 4

    A linguistic steganography based on word indexing compression and candidate selection Autor Xiang, Lingyun, Wu, Wenshuai, Li, Xu, Yang, Chunfang

    ISSN: 1380-7501, 1573-7721
    Vydáno: New York Springer US 01.11.2018
    Vydáno v Multimedia tools and applications (01.11.2018)
    “… The length of the practical embedded payload can be reduced by the proposed word indexing compression algorithm(WIC…”
    Získat plný text
    Journal Article
  5. 5

    Non-overlapping indexing in BWT-runs bounded space Autor Gibney, Daniel, MacNichol, Paul, Thankachan, Sharma V.

    ISSN: 0304-3975
    Vydáno: Elsevier B.V 21.11.2025
    Vydáno v Theoretical computer science (21.11.2025)
    “…We revisit the non-overlapping indexing problem for an efficient repetition-aware solution…”
    Získat plný text
    Journal Article
  6. 6

    Lightweight algorithms for constructing and inverting the BWT of string collections Autor Bauer, Markus J., Cox, Anthony J., Rosone, Giovanna

    ISSN: 0304-3975, 1879-2294
    Vydáno: Elsevier B.V 29.04.2013
    Vydáno v Theoretical computer science (29.04.2013)
    “… Such a dataset can now be generated in just a few days on a single sequencing machine. Many algorithms and data structures for compression and indexing of text have the BWT…”
    Získat plný text
    Journal Article
  7. 7

    Foldcomp: a library and format for compressing and indexing large protein structure sets Autor Kim, Hyunbin, Mirdita, Milot, Steinegger, Martin

    ISSN: 1367-4811, 1367-4803, 1367-4811
    Vydáno: England Oxford University Press 03.04.2023
    Vydáno v Bioinformatics (Oxford, England) (03.04.2023)
    “…; these pose a challenge in terms of storage and processing. Here, we present Foldcomp, a novel lossy structure compression algorithm, and indexing system to address this challenge…”
    Získat plný text
    Journal Article
  8. 8

    CREMSA : compressed indexing of (ultra) large multiple sequence alignments Autor Salson, Mikaël, Boddaert, Arthur, Gueye, Awa Bousso, Bulteau, Laurent, Hernandez--Courbevoie, Yohan, Marchet, Camille, Pan, Nan, Will, Sebastian, Ponty, Yann

    ISSN: 1367-4803, 1367-4811, 1367-4811
    Vydáno: England Oxford Publishing Limited (England) 01.07.2025
    Vydáno v Bioinformatics (Oxford, England) (01.07.2025)
    “…). Such MSAs are largely ungapped, and mostly homogeneous on a column-wise level but not at a sequential level due to local variations, hindering the performances of sequential compression algorithms…”
    Získat plný text
    Journal Article
  9. 9

    On compressing and indexing repetitive sequences Autor Kreft, Sebastian, Navarro, Gonzalo

    ISSN: 0304-3975, 1879-2294
    Vydáno: Elsevier B.V 29.04.2013
    Vydáno v Theoretical computer science (29.04.2013)
    “…We introduce LZ-End, a new member of the Lempel–Ziv family of text compressors, which achieves compression ratios close to those of LZ77 but is much faster at extracting arbitrary text substrings…”
    Získat plný text
    Journal Article
  10. 10

    Stronger Lempel-Ziv Based Compressed Text Indexing Autor Arroyuelo, Diego, Navarro, Gonzalo, Sadakane, Kunihiko

    ISSN: 0178-4617, 1432-0541
    Vydáno: New York Springer-Verlag 01.02.2012
    Vydáno v Algorithmica (01.02.2012)
    “…Given a text T [1.. u ] over an alphabet of size σ , the full-text search problem consists in finding the occ occurrences of a given pattern P [1.. m…”
    Získat plný text
    Journal Article
  11. 11

    Indexing k -mers in linear space for quality value compression Autor Shibuya, Yoshihiro, Comin, Matteo

    ISSN: 1757-6334, 1757-6334
    Vydáno: Singapore 01.10.2019
    “…Many bioinformatics tools heavily rely on -mer dictionaries to describe the composition of sequences and allow for faster reference-free algorithms or look-ups…”
    Zjistit podrobnosti o přístupu
    Journal Article
  12. 12

    Efficient Compression and Indexing for Highly Repetitive DNA Sequence Collections Autor Huo, Hongwei, Chen, Xiaoyang, Guo, Xu, Vitter, Jeffrey Scott

    ISSN: 1545-5963, 1557-9964, 1557-9964
    Vydáno: United States IEEE 01.11.2021
    “…In this paper, we focus upon the important problem of indexing and searching highly repetitive DNA sequence collections…”
    Získat plný text
    Journal Article
  13. 13

    Fixed Block Compression Boosting in FM-Indexes: Theory and Practice Autor Gog, Simon, Kärkkäinen, Juha, Kempa, Dominik, Petri, Matthias, Puglisi, Simon J.

    ISSN: 0178-4617, 1432-0541
    Vydáno: New York Springer US 01.04.2019
    Vydáno v Algorithmica (01.04.2019)
    “… Our main theoretical result is a new technique called fixed block compression boosting , which is a simpler and faster alternative to optimal compression boosting and implicit compression boosting…”
    Získat plný text
    Journal Article
  14. 14

    Resolution of the Burrows-Wheeler Transform Conjecture Autor Kempa, Dominik, Kociumaka, Tomasz

    ISSN: 2575-8454
    Vydáno: IEEE 01.11.2020
    “… This is in contrast to nearly all other known compression methods, whose sizes have been shown to be either always within a \text{polylog}n factor…”
    Získat plný text
    Konferenční příspěvek
  15. 15

    Collapsing the Hierarchy of Compressed Data Structures: Suffix Arrays in Optimal Compressed Space Autor Kempa, Dominik, Kociumaka, Tomasz

    ISSN: 2575-8454
    Vydáno: IEEE 06.11.2023
    “…The last two decades have witnessed a dramatic increase in the amount of highly repetitive datasets consisting of sequential data (strings, texts…”
    Získat plný text
    Konferenční příspěvek
  16. 16

    A new class of string transformations for compressed text indexing Autor Giancarlo, Raffaele, Manzini, Giovanni, Restivo, Antonio, Rosone, Giovanna, Sciortino, Marinella

    ISSN: 0890-5401
    Vydáno: Elsevier Inc 01.10.2023
    Vydáno v Information and computation (01.10.2023)
    “…Introduced about thirty years ago in the field of data compression, the Burrows-Wheeler Transform (BWT…”
    Získat plný text
    Journal Article
  17. 17

    Geometric BWT: Compressed Text Indexing via Sparse Suffixes and Range Searching Autor Chien, Yu-Feng, Hon, Wing-Kai, Shah, Rahul, Thankachan, Sharma V., Vitter, Jeffrey Scott

    ISSN: 0178-4617, 1432-0541
    Vydáno: Boston Springer US 01.02.2015
    Vydáno v Algorithmica (01.02.2015)
    “… This allows us to apply the lower bounds known in the field of orthogonal range searching to the problems in compressed text indexing…”
    Získat plný text
    Journal Article
  18. 18

    On-Demand Indexing for Referential Compression of DNA Sequences Autor Alves, Fernando, Cogo, Vinicius, Wandelt, Sebastian, Leser, Ulf, Bessani, Alysson

    ISSN: 1932-6203, 1932-6203
    Vydáno: United States Public Library of Science 06.07.2015
    Vydáno v PloS one (06.07.2015)
    “… Referential compression is one of these techniques, in which the similarity between the DNA of organisms of the same or an evolutionary close species is exploited to reduce the storage demands…”
    Získat plný text
    Journal Article
  19. 19

    Indexing labeled sequences Autor Rocher, Tatiana, Giraud, Mathieu, Salson, Mikaël

    ISSN: 2376-5992, 2376-5992
    Vydáno: United States PeerJ. Ltd 26.03.2018
    Vydáno v PeerJ. Computer science (26.03.2018)
    “…are a way to add some information on a text, such as functional annotations such as genes on a DNA sequences. V(D…”
    Získat plný text
    Journal Article
  20. 20

    Compressed indexing and local alignment of DNA Autor Lam, T. W., Sung, W. K., Tam, S. L., Wong, C. K., Yiu, S. M.

    ISSN: 1367-4803, 1367-4811, 1460-2059, 1367-4811
    Vydáno: Oxford Oxford University Press 15.03.2008
    Vydáno v Bioinformatics (15.03.2008)
    “…). Without indexing, one can use dynamic programming to find all the local alignments between a text T and a pattern P in O(|T||P…”
    Získat plný text
    Journal Article