Dynamic suffix array with polylogarithmic queries and updates

D Kempa, T Kociumaka - Proceedings of the 54th Annual ACM SIGACT …, 2022 - dl.acm.org
The suffix array SA [1.. n] of a text T of length n is a permutation of {1,…, n} describing the
lexicographical ordering of suffixes of T and is considered to be one of the most important …

Prospects and limitations of full-text index structures in genome analysis

M Vyverman, B De Baets, V Fack… - Nucleic acids …, 2012 - academic.oup.com
The combination of incessant advances in sequencing technology producing large amounts
of data and innovative bioinformatics approaches, designed to cope with this data flood, has …

[HTML][HTML] Dynamic index and LZ factorization in compressed space

T Nishimoto, I Tomohiro, S Inenaga, H Bannai… - Discrete Applied …, 2020 - Elsevier
In this paper, we propose a new dynamic compressed index of O (w) space for a dynamic
text T, where w= O (min (z log N log∗ M, N)) is the size of the signature encoding of T, z is …

Stream-based translation models for statistical machine translation

A Levenberg, C Callison-Burch… - … Technologies: The 2010 …, 2010 - research.ed.ac.uk
Typical statistical machine translation systems are trained with static parallel corpora. Here
we account for scenarios with a continuous incoming stream of parallel training data. Such …

[HTML][HTML] Position heaps: A simple and dynamic text indexing data structure

A Ehrenfeucht, RM McConnell, N Osheim… - Journal of Discrete …, 2011 - Elsevier
We address the problem of finding the locations of all instances of a string P in a text T,
where preprocessing of T is allowed in order to facilitate the queries. Previous data …

Compressed suffix arrays for massive data

J Sirén - International Symposium on String Processing and …, 2009 - Springer
We present a fast space-efficient algorithm for constructing compressed suffix arrays (CSA).
The algorithm requires O (n log n) time in the worst case, and only O (n) bits of extra space in …

Novel computational techniques for mapping and classification of Next-Generation Sequencing data

K Brinda - 2016 - hal.science
Since their emergence around 2006, Next-Generation Sequencing technologies have been
revolutionizing biological and medical research. Obtaining instantly an extensive amount of …

Update query time trade-off for dynamic suffix arrays

A Amir, I Boneh - arXiv preprint arXiv:2007.06604, 2020 - arxiv.org
The Suffix Array SA (S) of a string S [1... n] is an array containing all the suffixes of S sorted
by lexicographic order. The suffix array is one of the most well known indexing data …

Complex event detection in extremely resource-constrained wireless sensor networks

M Zoumboulakis, G Roussos - Mobile Networks and Applications, 2011 - Springer
Complex Events are sequences of sensor measurements indicating interesting or unusual
activity in the monitored process. Such events are ubiquitous in a wide range of Wireless …

Secure hashing-based verifiable pattern matching

F Chen, D Wang, R Li, J Chen, Z Ming… - IEEE Transactions …, 2018 - ieeexplore.ieee.org
Verifiable pattern matching is the problem of finding a given pattern verifiably from the
outsourced textual data, which is resident in an untrusted remote server. This problem has …