Optimal square detection over general alphabets
Squares (fragments of the form xx, for some string x) are arguably the most natural type of
repetition in strings. The basic algorithmic question concerning squares is to check if a given …
repetition in strings. The basic algorithmic question concerning squares is to check if a given …
[HTML][HTML] Approximate cover of strings
Regularities in strings arise in various areas of science, including coding and automata
theory, formal language theory, combinatorics, molecular biology and many others. A …
theory, formal language theory, combinatorics, molecular biology and many others. A …
Can we recover the cover?
Data analysis typically involves error recovery and detection of regularities as two different
key tasks. In this paper we show that there are data types for which these two tasks can be …
key tasks. In this paper we show that there are data types for which these two tasks can be …
Streaming periodicity with mismatches
We study the problem of finding all $ k $-periods of a length-$ n $ string $ S $, presented as
a data stream. $ S $ is said to have $ k $-period $ p $ if its prefix of length $ np $ differs from …
a data stream. $ S $ is said to have $ k $-period $ p $ if its prefix of length $ np $ differs from …
On approximating string selection problems with outliers
Many problems in bioinformatics are about finding strings that approximately represent a
collection of given strings. We look at more general problems where some input strings can …
collection of given strings. We look at more general problems where some input strings can …
Periodicity in data streams with wildcards
We investigate the problem of detecting periodic trends within a string S of length n, arriving
in the streaming model, containing at most k wildcard characters, where k= o (n). A wildcard …
in the streaming model, containing at most k wildcard characters, where k= o (n). A wildcard …
[PDF][PDF] Efficient string algorithmics across alphabet realms
J Ellert - 2024 - eldorado.tu-dortmund.de
Stringology is a subfield of computer science dedicated to analyzing and processing
sequences of symbols. It plays a crucial role in various applications, including lossless …
sequences of symbols. It plays a crucial role in various applications, including lossless …
MBPD: Motif-based period detection
Massive amounts of data are generated daily at a rapid rate. As a result, the world is faced
with unprecedented challenges and opportunities on managing the ever-growing data …
with unprecedented challenges and opportunities on managing the ever-growing data …
A comprehensive study on periodicity mining algorithms
Mining knowledge from time series database is always a challenging task due to inherent
complexity. Periodicity mining is one of the methods for analysing time series data in order to …
complexity. Periodicity mining is one of the methods for analysing time series data in order to …
Quasi-periodicity under mismatch errors
Tracing regularities plays a key role in data analysis for various areas of science, including
coding and automata theory, formal language theory, combinatorics, molecular biology and …
coding and automata theory, formal language theory, combinatorics, molecular biology and …