Sublinear time Lempel-Ziv (LZ77) factorization

J Ellert - International Symposium on String Processing and …, 2023 - Springer
Abstract The Lempel-Ziv (LZ77) factorization of a string is a widely-used algorithmic tool that
plays a central role in data compression and indexing. For a length-n string over integer …

[HTML][HTML] On parsing optimality for dictionary-based text compression—the Zip case

A Langiu - Journal of Discrete Algorithms, 2013 - Elsevier
Dictionary-based compression schemes are the most commonly used data compression
schemes since they appeared in the foundational paper of Ziv and Lempel in 1977, and …

[HTML][HTML] Note on the greedy parsing optimality for dictionary-based text compression

M Crochemore, A Langiu, F Mignosi - Theoretical Computer Science, 2014 - Elsevier
Dynamic dictionary-based compression schemes are the most daily used data compression
schemes since they appeared in the foundational paper of Ziv and Lempel in 1977 …

[PDF][PDF] Efficient string algorithmics across alphabet realms

J Ellert - 2024 - eldorado.tu-dortmund.de
Stringology is a subfield of computer science dedicated to analyzing and processing
sequences of symbols. It plays a crucial role in various applications, including lossless …

The rightmost equal-cost position problem

M Crochemore, A Langiu… - 2013 Data Compression …, 2013 - ieeexplore.ieee.org
LZ77-based compression schemes compress the input text by replacing factors in the text
with an encoded reference to a previous occurrence formed by the couple (length, offset) …

[HTML][HTML] Dictionary-symbolwise flexible parsing

M Crochemore, L Giambruno, A Langiu… - Journal of Discrete …, 2012 - Elsevier
Linear-time optimal parsing algorithms are rare in the dictionary-based branch of the data
compression theory. A recent result is the Flexible Parsing algorithm of Matias and Sahinalp …

[HTML][HTML] The greedy approach to dictionary-based static text compression on a distributed system

S De Agostino - Journal of Discrete Algorithms, 2015 - Elsevier
The greedy approach to dictionary-based static text compression can be executed by a finite-
state machine. When it is applied in parallel to different blocks of data independently, there …

[PDF][PDF] A Worst Case Analysis of the LZ2 Compression Algorithm with Bounded Size Dictionaries.

S De Agostino - PSC, 2023 - stringology.org
We make a worst case analysis of practical implementations of LZ2 compression, where the
work space remains constant with the increase of the data size and the optimal solution must …

Relations between greedy and bit-optimal LZ77 encodings

D Kosolobov - arXiv preprint arXiv:1707.09789, 2017 - arxiv.org
This paper investigates the size in bits of the LZ77 encoding, which is the most popular and
efficient variant of the Lempel-Ziv encodings used in data compression. We prove that, for a …

Greedy versus optimal analysis of bounded size dictionary compression and on-the-fly distributed computing

S De Agostino - Discrete Applied Mathematics, 2024 - Elsevier
Scalability and robustness are not an issue when compression is applied for massive data
storage, in the context of distributed computing. Speeding up on-the-fly compression for data …