A compressed enhanced suffix array supporting fast string matching

M Vyverman, B De Baets, V Fack… - Nucleic acids …, 2012 - academic.oup.com

The combination of incessant advances in sequencing technology producing large amounts
of data and innovative bioinformatics approaches, designed to cope with this data flood, has …

被引用次数：38 相关文章所有 15 个版本

[PDF] uchile.cl

Fully compressed suffix trees

LMS Russo, G Navarro, AL Oliveira - ACM transactions on algorithms …, 2011 - dl.acm.org

Suffix trees are by far the most important data structure in stringology, with a myriad of
applications in fields like bioinformatics and information retrieval. Classical representations …

被引用次数：81 相关文章

[PDF] uni-ulm.de

Cst++

E Ohlebusch, J Fischer, S Gog - … SPIRE 2010, Los Cabos, Mexico, October …, 2010 - Springer

Let A be an array of n elements taken from a totally ordered set. We present a data structure
of size 3 n+ o (n) bits that allows us to answer the following queries on A in constant time …

被引用次数：79 相关文章所有 7 个版本

[PDF] psu.edu

Computing matching statistics and maximal exact matches on compressed full-text indexes

E Ohlebusch, S Gog, A Kügel - … , SPIRE 2010, Los Cabos, Mexico, October …, 2010 - Springer

Exact string matching is a problem that computer programmers face on a regular basis, and
full-text indexes like the suffix tree or the suffix array provide fast string search over large …

被引用次数：66 相关文章所有 11 个版本

[PDF] academia.edu

Inverted indexes for phrases and strings

M Patil, SV Thankachan, R Shah, WK Hon… - Proceedings of the 34th …, 2011 - dl.acm.org

Inverted indexes are the most fundamental and widely used data structures in information
retrieval. For each unique word occurring in a document collection, the inverted index stores …

被引用次数：64 相关文章所有 11 个版本

[PDF] sciencedirect.com

Bidirectional search in a string with wavelet trees and bidirectional matching statistics

T Schnattinger, E Ohlebusch, S Gog - Information and Computation, 2012 - Elsevier

Searching for genes encoding microRNAs (miRNAs) is an important task in genome
analysis. Because the secondary structure of miRNA (but not the sequence) is highly …

被引用次数：45 相关文章所有 6 个版本

[PDF] sciencedirect.com

Combined data structure for previous-and next-smaller-values

J Fischer - Theoretical Computer Science, 2011 - Elsevier

Let A be a static array storing n elements from a totally ordered set. We present a data
structure of optimal size at most nlog2 (3+ 22)+ o (n) bits that allows us to answer the …

被引用次数：41 相关文章所有 11 个版本

[PDF] uni-ulm.de

Compressed suffix trees: design, construction, and applications

S Gog - 2011 - oparu.uni-ulm.de

In sequence analysis it is often advantageous to build an index data structure for large texts,
as many tasks--for instance repeated pattern matching--can then be solved in optimal time …

被引用次数：34 相关文章所有 3 个版本

Compressed suffix trees: Efficient computation and storage of LCP-values

S Gog, E Ohlebusch - Journal of Experimental Algorithmics (JEA), 2013 - dl.acm.org

The suffix tree is a very important data structure in string processing, but typical
implementations suffer from huge space consumption. In large-scale applications …

被引用次数：26 相关文章

[PDF] springer.com

Bitpacking techniques for indexing genomes: II. Enhanced suffix arrays

TD Wu - Algorithms for Molecular Biology, 2016 - Springer

Background Suffix arrays and their variants are used widely for representing genomes in
search applications. Enhanced suffix arrays (ESAs) provide fast search speed, but require …

被引用次数：18 相关文章所有 23 个版本