[HTML][HTML] Wavelet trees for all

G Navarro - Journal of Discrete Algorithms, 2014 - Elsevier
The wavelet tree is a versatile data structure that serves a number of purposes, from string
processing to computational geometry. It can be regarded as a device that represents a …

Storage and retrieval of highly repetitive sequence collections

V Mäkinen, G Navarro, J Sirén… - Journal of Computational …, 2010 - liebertpub.com
A repetitive sequence collection is a set of sequences which are small variations of each
other. A prominent example are genome sequences of individuals of the same or close …

Practical rank/select queries over arbitrary sequences

F Claude, G Navarro - International Symposium on String Processing and …, 2008 - Springer
We present a practical study on the compact representation of sequences supporting rank,
select, and access queries. While there are several theoretical solutions to the problem, only …

New algorithms on wavelet trees and applications to information retrieval

T Gagie, G Navarro, SJ Puglisi - Theoretical Computer Science, 2012 - Elsevier
Wavelet trees are widely used in the representation of sequences, permutations, text
collections, binary relations, discrete points, and other succinct data structures. We show …

The wavelet matrix: An efficient wavelet tree for large alphabets

F Claude, G Navarro, A Ordónez - Information Systems, 2015 - Elsevier
The wavelet tree is a flexible data structure that permits representing sequences S [1, n] of
symbols over an alphabet of size σ, within compressed space and supporting a wide range …

[HTML][HTML] Fast relative Lempel–Ziv self-index for similar sequences

HH Do, J Jansson, K Sadakane, WK Sung - Theoretical Computer Science, 2014 - Elsevier
Recent advances in biotechnology and web technology are continuously generating huge
collections of similar strings. People now face the problem of storing them compactly while …

The compressed permuterm index

P Ferragina, R Venturini - ACM Transactions on Algorithms (TALG), 2010 - dl.acm.org
The Permuterm index [Garfield 1976] is a time-efficient and elegant solution to the string
dictionary problem in which pattern queries may possibly include one wild-card symbol …

Colored range queries and document retrieval

T Gagie, J Kärkkäinen, G Navarro, SJ Puglisi - Theoretical Computer …, 2013 - Elsevier
Colored range queries are a well-studied topic in computational geometry and database
research that, in the past decade, have found exciting applications in information retrieval. In …

The myriad virtues of wavelet trees

P Ferragina, R Giancarlo, G Manzini - Information and Computation, 2009 - Elsevier
Wavelet Trees have been introduced by Grossi et al. in SODA 2003 and have been rapidly
recognized as a very flexible tool for the design of compressed full-text indexes and data …

[HTML][HTML] Fixed block compression boosting in FM-indexes: Theory and practice

S Gog, J Kärkkäinen, D Kempa, M Petri, SJ Puglisi - Algorithmica, 2019 - Springer
The FM index (Ferragina and Manzini in J ACM 52 (4): 552–581, 2005) is a widely-used
compressed data structure that stores a string T in a compressed form and also supports fast …