Search Results - Text Compression and Indexing Algorithms
-
1
Multidimensional query processing algorithm by dimension transformation
ISSN: 2045-2322Published: Springer Science and Business Media LLC 11.04.2023Published in Scientific Reports (11.04.2023)Get full text
Journal Article -
2
Frequent contiguous pattern mining over biological sequences of protein misfolded diseases
ISSN: 1471-2105Published: Springer Science and Business Media LLC 11.09.2021Published in BMC Bioinformatics (11.09.2021)Get full text
Journal Article -
3
Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform
ISSN: 1367-4803, 1367-4811, 1367-4811Published: Oxford Oxford University Press 01.06.2012Published in Bioinformatics (Oxford, England) (01.06.2012)“…Motivation: The Burrows–Wheeler transform (BWT) is the foundation of many algorithms for compression and indexing of text data, but the cost of computing…”
Get full text
Journal Article -
4
A linguistic steganography based on word indexing compression and candidate selection
ISSN: 1380-7501, 1573-7721Published: New York Springer US 01.11.2018Published in Multimedia tools and applications (01.11.2018)“… The length of the practical embedded payload can be reduced by the proposed word indexing compression algorithm(WIC…”
Get full text
Journal Article -
5
Non-overlapping indexing in BWT-runs bounded space
ISSN: 0304-3975Published: Elsevier B.V 21.11.2025Published in Theoretical computer science (21.11.2025)“…We revisit the non-overlapping indexing problem for an efficient repetition-aware solution…”
Get full text
Journal Article -
6
Lightweight algorithms for constructing and inverting the BWT of string collections
ISSN: 0304-3975, 1879-2294Published: Elsevier B.V 29.04.2013Published in Theoretical computer science (29.04.2013)“… Such a dataset can now be generated in just a few days on a single sequencing machine. Many algorithms and data structures for compression and indexing of text have the BWT…”
Get full text
Journal Article -
7
Foldcomp: a library and format for compressing and indexing large protein structure sets
ISSN: 1367-4811, 1367-4803, 1367-4811Published: England Oxford University Press 03.04.2023Published in Bioinformatics (Oxford, England) (03.04.2023)“…; these pose a challenge in terms of storage and processing. Here, we present Foldcomp, a novel lossy structure compression algorithm, and indexing system to address this challenge…”
Get full text
Journal Article -
8
CREMSA : compressed indexing of (ultra) large multiple sequence alignments
ISSN: 1367-4803, 1367-4811, 1367-4811Published: England Oxford Publishing Limited (England) 01.07.2025Published in Bioinformatics (Oxford, England) (01.07.2025)“…). Such MSAs are largely ungapped, and mostly homogeneous on a column-wise level but not at a sequential level due to local variations, hindering the performances of sequential compression algorithms…”
Get full text
Journal Article -
9
On compressing and indexing repetitive sequences
ISSN: 0304-3975, 1879-2294Published: Elsevier B.V 29.04.2013Published in Theoretical computer science (29.04.2013)“…We introduce LZ-End, a new member of the Lempel–Ziv family of text compressors, which achieves compression ratios close to those of LZ77 but is much faster at extracting arbitrary text substrings…”
Get full text
Journal Article -
10
Stronger Lempel-Ziv Based Compressed Text Indexing
ISSN: 0178-4617, 1432-0541Published: New York Springer-Verlag 01.02.2012Published in Algorithmica (01.02.2012)“…Given a text T [1.. u ] over an alphabet of size σ , the full-text search problem consists in finding the occ occurrences of a given pattern P [1.. m…”
Get full text
Journal Article -
11
Indexing k -mers in linear space for quality value compression
ISSN: 1757-6334, 1757-6334Published: Singapore 01.10.2019Published in Journal of bioinformatics and computational biology (01.10.2019)“…Many bioinformatics tools heavily rely on -mer dictionaries to describe the composition of sequences and allow for faster reference-free algorithms or look-ups…”
Get more information
Journal Article -
12
Efficient Compression and Indexing for Highly Repetitive DNA Sequence Collections
ISSN: 1545-5963, 1557-9964, 1557-9964Published: United States IEEE 01.11.2021Published in IEEE/ACM transactions on computational biology and bioinformatics (01.11.2021)“…In this paper, we focus upon the important problem of indexing and searching highly repetitive DNA sequence collections…”
Get full text
Journal Article -
13
Fixed Block Compression Boosting in FM-Indexes: Theory and Practice
ISSN: 0178-4617, 1432-0541Published: New York Springer US 01.04.2019Published in Algorithmica (01.04.2019)“… Our main theoretical result is a new technique called fixed block compression boosting , which is a simpler and faster alternative to optimal compression boosting and implicit compression boosting…”
Get full text
Journal Article -
14
Resolution of the Burrows-Wheeler Transform Conjecture
ISSN: 2575-8454Published: IEEE 01.11.2020Published in Proceedings / annual Symposium on Foundations of Computer Science (01.11.2020)“… This is in contrast to nearly all other known compression methods, whose sizes have been shown to be either always within a \text{polylog}n factor…”
Get full text
Conference Proceeding -
15
Collapsing the Hierarchy of Compressed Data Structures: Suffix Arrays in Optimal Compressed Space
ISSN: 2575-8454Published: IEEE 06.11.2023Published in Proceedings / annual Symposium on Foundations of Computer Science (06.11.2023)“…The last two decades have witnessed a dramatic increase in the amount of highly repetitive datasets consisting of sequential data (strings, texts…”
Get full text
Conference Proceeding -
16
A new class of string transformations for compressed text indexing
ISSN: 0890-5401Published: Elsevier Inc 01.10.2023Published in Information and computation (01.10.2023)“…Introduced about thirty years ago in the field of data compression, the Burrows-Wheeler Transform (BWT…”
Get full text
Journal Article -
17
Geometric BWT: Compressed Text Indexing via Sparse Suffixes and Range Searching
ISSN: 0178-4617, 1432-0541Published: Boston Springer US 01.02.2015Published in Algorithmica (01.02.2015)“… This allows us to apply the lower bounds known in the field of orthogonal range searching to the problems in compressed text indexing…”
Get full text
Journal Article -
18
On-Demand Indexing for Referential Compression of DNA Sequences
ISSN: 1932-6203, 1932-6203Published: United States Public Library of Science 06.07.2015Published in PloS one (06.07.2015)“… Referential compression is one of these techniques, in which the similarity between the DNA of organisms of the same or an evolutionary close species is exploited to reduce the storage demands…”
Get full text
Journal Article -
19
Indexing labeled sequences
ISSN: 2376-5992, 2376-5992Published: United States PeerJ. Ltd 26.03.2018Published in PeerJ. Computer science (26.03.2018)“…are a way to add some information on a text, such as functional annotations such as genes on a DNA sequences. V(D…”
Get full text
Journal Article -
20
Compressed indexing and local alignment of DNA
ISSN: 1367-4803, 1367-4811, 1460-2059, 1367-4811Published: Oxford Oxford University Press 15.03.2008Published in Bioinformatics (15.03.2008)“…). Without indexing, one can use dynamic programming to find all the local alignments between a text T and a pattern P in O(|T||P…”
Get full text
Journal Article