Cst++

E Ohlebusch, J Fischer, S Gog - … SPIRE 2010, Los Cabos, Mexico, October …, 2010 - Springer
Let A be an array of n elements taken from a totally ordered set. We present a data structure
of size 3 n+ o (n) bits that allows us to answer the following queries on A in constant time …

[HTML][HTML] Computing the longest common prefix array based on the Burrows–Wheeler transform

T Beller, S Gog, E Ohlebusch, T Schnattinger - Journal of Discrete …, 2013 - Elsevier
Many sequence analysis tasks can be accomplished with a suffix array, and several of them
additionally need the longest common prefix array. In large scale applications, suffix arrays …

[图书][B] Shared-memory parallelism can be simple, fast, and scalable

J Shun - 2017 - books.google.com
Parallelism is the key to achieving high performance in computing. However, writing efficient
and scalable parallel programs is notoriously difficult, and often requires significant …

Parallel wavelet tree construction

J Shun - 2015 Data compression conference, 2015 - ieeexplore.ieee.org
We present parallel algorithms for wavelet tree construction with polylogarithmic depth,
improving upon the linear depth of the recent parallel algorithms by Fuentes-Sepulveda et …

Fast parallel computation of longest common prefixes

J Shun - SC'14: Proceedings of the International Conference …, 2014 - ieeexplore.ieee.org
Suffix arrays and the corresponding longest common prefix (LCP) array have wide
applications in bioinformatics, information retrieval and data compression. In this work, we …

[HTML][HTML] Tighter bounds for the sum of irreducible LCP values

J Kärkkäinen, D Kempa, M Piatkowski - Theoretical Computer Science, 2016 - Elsevier
The suffix array is frequently augmented with the longest-common-prefix (LCP) array that
stores the lengths of the longest common prefixes between lexicographically adjacent …

Efficient maximal repeat finding using the burrows-wheeler transform and wavelet tree

MO Kulekci, JS Vitter, B Xu - IEEE/ACM Transactions on …, 2011 - ieeexplore.ieee.org
Finding repetitive structures in genomes and proteins is important to understand their
biological functions. Many data compressors for modern genomic sequences rely heavily on …

LCP array construction in external memory

J Kärkkäinen, D Kempa - Journal of Experimental Algorithmics (JEA), 2016 - dl.acm.org
One of the most important data structures for string processing—the suffix array—needs to
be augmented with the longest-common-prefix (LCP) array in numerous applications. We …

slaMEM: efficient retrieval of maximal exact matches using a sampled LCP array

F Fernandes, AT Freitas - Bioinformatics, 2014 - academic.oup.com
Motivation: Maximal exact matches, or just MEMs, are a powerful tool in the context of
multiple sequence alignment and approximate string matching. The most efficient algorithms …

Compressed suffix trees: Efficient computation and storage of LCP-values

S Gog, E Ohlebusch - Journal of Experimental Algorithmics (JEA), 2013 - dl.acm.org
The suffix tree is a very important data structure in string processing, but typical
implementations suffer from huge space consumption. In large-scale applications …