Spaces, trees, and colors: The algorithmic landscape of document retrieval on sequences
G Navarro - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Document retrieval is one of the best-established information retrieval activities since
the'60s, pervading all search engines. Its aim is to obtain, from a collection of text …
the'60s, pervading all search engines. Its aim is to obtain, from a collection of text …
[HTML][HTML] Wavelet trees for all
G Navarro - Journal of Discrete Algorithms, 2014 - Elsevier
The wavelet tree is a versatile data structure that serves a number of purposes, from string
processing to computational geometry. It can be regarded as a device that represents a …
processing to computational geometry. It can be regarded as a device that represents a …
New algorithms on wavelet trees and applications to information retrieval
Wavelet trees are widely used in the representation of sequences, permutations, text
collections, binary relations, discrete points, and other succinct data structures. We show …
collections, binary relations, discrete points, and other succinct data structures. We show …
The wavelet matrix: An efficient wavelet tree for large alphabets
The wavelet tree is a flexible data structure that permits representing sequences S [1, n] of
symbols over an alphabet of size σ, within compressed space and supporting a wide range …
symbols over an alphabet of size σ, within compressed space and supporting a wide range …
Improved grammar-based compressed indexes
We introduce the first grammar-compressed representation of a sequence that supports
searches in time that depends only logarithmically on the size of the grammar. Given a text T …
searches in time that depends only logarithmically on the size of the grammar. Given a text T …
Columnar storage and list-based processing for graph database management systems
We revisit column-oriented storage and query processing techniques in the context of
contemporary graph database management systems (GDBMSs). Similar to column-oriented …
contemporary graph database management systems (GDBMSs). Similar to column-oriented …
[HTML][HTML] Fast relative Lempel–Ziv self-index for similar sequences
Recent advances in biotechnology and web technology are continuously generating huge
collections of similar strings. People now face the problem of storing them compactly while …
collections of similar strings. People now face the problem of storing them compactly while …
Wavelet trees: A survey
C Makris - Computer Science and Information Systems, 2012 - doiserbia.nb.rs
The topic of this paper is the exploration of the various characteristics of the wavelet tree
data structure, a data structure that was initially proposed for text compression applications …
data structure, a data structure that was initially proposed for text compression applications …
Space-efficient data-analysis queries on grids
We consider various data-analysis queries on two-dimensional points. We give new
space/time tradeoffs over previous work on geometric queries such as dominance and …
space/time tradeoffs over previous work on geometric queries such as dominance and …