Spaces, trees, and colors: The algorithmic landscape of document retrieval on sequences

G Navarro - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Document retrieval is one of the best-established information retrieval activities since
the'60s, pervading all search engines. Its aim is to obtain, from a collection of text …

[HTML][HTML] Wavelet trees for all

G Navarro - Journal of Discrete Algorithms, 2014 - Elsevier
The wavelet tree is a versatile data structure that serves a number of purposes, from string
processing to computational geometry. It can be regarded as a device that represents a …

New algorithms on wavelet trees and applications to information retrieval

T Gagie, G Navarro, SJ Puglisi - Theoretical Computer Science, 2012 - Elsevier
Wavelet trees are widely used in the representation of sequences, permutations, text
collections, binary relations, discrete points, and other succinct data structures. We show …

The wavelet matrix: An efficient wavelet tree for large alphabets

F Claude, G Navarro, A Ordónez - Information Systems, 2015 - Elsevier
The wavelet tree is a flexible data structure that permits representing sequences S [1, n] of
symbols over an alphabet of size σ, within compressed space and supporting a wide range …

Improved grammar-based compressed indexes

F Claude, G Navarro - International Symposium on String Processing and …, 2012 - Springer
We introduce the first grammar-compressed representation of a sequence that supports
searches in time that depends only logarithmically on the size of the grammar. Given a text T …

Optimal dynamic sequence representations

G Navarro, Y Nekrich - SIAM Journal on Computing, 2014 - SIAM
We describe a data structure that supports access, rank, and select queries, as well as
symbol insertions and deletions, on a string S 1, n over alphabet 1..σ in time …

Columnar storage and list-based processing for graph database management systems

P Gupta, A Mhedhbi, S Salihoglu - arXiv preprint arXiv:2103.02284, 2021 - arxiv.org
We revisit column-oriented storage and query processing techniques in the context of
contemporary graph database management systems (GDBMSs). Similar to column-oriented …

[HTML][HTML] Fast relative Lempel–Ziv self-index for similar sequences

HH Do, J Jansson, K Sadakane, WK Sung - Theoretical Computer Science, 2014 - Elsevier
Recent advances in biotechnology and web technology are continuously generating huge
collections of similar strings. People now face the problem of storing them compactly while …

Wavelet trees: A survey

C Makris - Computer Science and Information Systems, 2012 - doiserbia.nb.rs
The topic of this paper is the exploration of the various characteristics of the wavelet tree
data structure, a data structure that was initially proposed for text compression applications …

Space-efficient data-analysis queries on grids

G Navarro, Y Nekrich, LMS Russo - Theoretical Computer Science, 2013 - Elsevier
We consider various data-analysis queries on two-dimensional points. We give new
space/time tradeoffs over previous work on geometric queries such as dominance and …