Queries on LZ-bounded encodings

G Navarro - ACM Computing Surveys (CSUR), 2021 - dl.acm.org

Two decades ago, a breakthrough in indexing string collections made it possible to
represent them within their compressed space while at the same time offering indexed …

被引用次数：117 相关文章所有 7 个版本

[PDF] arxiv.org

Fully functional suffix trees and optimal text searching in BWT-runs bounded space

T Gagie, G Navarro, N Prezza - Journal of the ACM (JACM), 2020 - dl.acm.org

Indexing highly repetitive texts—such as genomic databases, software repositories and
versioned text collections—has become an important problem since the turn of the …

被引用次数：193 相关文章所有 12 个版本

[PDF] arxiv.org

At the roots of dictionary compression: string attractors

D Kempa, N Prezza - Proceedings of the 50th Annual ACM SIGACT …, 2018 - dl.acm.org

A well-known fact in the field of lossless text compression is that high-order entropy is a
weak model when the input contains long repetitions. Motivated by this fact, decades of …

被引用次数：152 相关文章所有 17 个版本

[PDF] acm.org Full View

Resolution of the burrows-wheeler transform conjecture

D Kempa, T Kociumaka - Communications of the ACM, 2022 - dl.acm.org

Abstract The Burrows-Wheeler Transform (BWT) is an invertible text transformation that
permutes symbols of a text according to the lexicographical order of its suffixes. BWT is the …

被引用次数：92 相关文章所有 10 个版本

[PDF] siam.org

Optimal-time text indexing in BWT-runs bounded space

T Gagie, G Navarro, N Prezza - Proceedings of the Twenty-Ninth Annual ACM …, 2018 - SIAM

Indexing highly repetitive texts—such as genomic databases, software repositories and
versioned text collections—has become an important problem since the turn of the …

被引用次数：130 相关文章所有 13 个版本

[PDF] unive.it

Towards a definitive measure of repetitiveness

T Kociumaka, G Navarro, N Prezza - Latin American Symposium on …, 2020 - Springer

Unlike in statistical compression, where Shannon's entropy is a definitive lower bound, no
such clear measure exists for the compressibility of repetitive sequences. Since statistical …

被引用次数：61 相关文章所有 5 个版本

[PDF] arxiv.org

Dynamic suffix array with polylogarithmic queries and updates

D Kempa, T Kociumaka - Proceedings of the 54th Annual ACM SIGACT …, 2022 - dl.acm.org

The suffix array SA [1.. n] of a text T of length n is a permutation of {1,…, n} describing the
lexicographical ordering of suffixes of T and is considered to be one of the most important …

被引用次数：28 相关文章所有 4 个版本

[PDF] siam.org

An upper bound and linear-space queries on the LZ-End parsing

D Kempa, B Saha - Proceedings of the 2022 Annual ACM-SIAM …, 2022 - SIAM

Lempel–Ziv (LZ77) compression is the most commonly used lossless compression
algorithm. The basic idea is to greedily break the input string into blocks (called “phrases”) …

被引用次数：22 相关文章所有 6 个版本

[HTML] sciencedirect.com

[HTML][HTML] Sensitivity of string compressors and repetitiveness measures

T Akagi, M Funakoshi, S Inenaga - Information and Computation, 2023 - Elsevier

The sensitivity of a string compression algorithm C asks how much the output size C (T) for
an input string T can increase when a single character edit operation is performed on T. This …

被引用次数：26 相关文章所有 6 个版本

[HTML] sciencedirect.com

[HTML][HTML] Universal compressed text indexing

G Navarro, N Prezza - Theoretical Computer Science, 2019 - Elsevier

The rise of repetitive datasets has lately generated a lot of interest in compressed self-
indexes based on dictionary compression, a rich and heterogeneous family of techniques …

被引用次数：55 相关文章所有 12 个版本

Indexing highly repetitive string collections, part II: Compressed indexes

Fully functional suffix trees and optimal text searching in BWT-runs bounded space

At the roots of dictionary compression: string attractors

Resolution of the burrows-wheeler transform conjecture

Optimal-time text indexing in BWT-runs bounded space

Towards a definitive measure of repetitiveness

Dynamic suffix array with polylogarithmic queries and updates

An upper bound and linear-space queries on the LZ-End parsing

[HTML][HTML] Sensitivity of string compressors and repetitiveness measures

[HTML][HTML] Universal compressed text indexing

高级搜索

引用