A guided tour to approximate string matching
G Navarro - ACM computing surveys (CSUR), 2001 - dl.acm.org
We survey the current techniques to cope with the problem of string matching that allows
errors. This is becoming a more and more relevant issue for many fast growing areas such …
errors. This is becoming a more and more relevant issue for many fast growing areas such …
Discovery of frequent episodes in event sequences
H Mannila, H Toivonen, A Inkeri Verkamo - Data mining and knowledge …, 1997 - Springer
Sequences of events describing the behavior and actions of users or systems can be
collected in several domains. An episode is a collection of events that occur relatively close …
collected in several domains. An episode is a collection of events that occur relatively close …
Mining minimal distinguishing subsequence patterns with gap constraints
Discovering contrasts between collections of data is an important task in data mining. In this
paper, we introduce a new type of contrast pattern, called a Minimal Distinguishing …
paper, we introduce a new type of contrast pattern, called a Minimal Distinguishing …
Reliable detection of episodes in event sequences
R Gwadera, MJ Atallah, W Szpankowski - Knowledge and Information …, 2005 - Springer
Suppose one wants to detect bad or suspicious subsequences in event sequences. Whether
an observed pattern of activity (in the form of a particular subsequence) is significant and …
an observed pattern of activity (in the form of a particular subsequence) is significant and …
Accelerating edit-distance sequence alignment on GPU using the wavefront algorithm
Sequence alignment remains a fundamental problem with practical applications ranging
from pattern recognition to computational biology. Traditional algorithms based on dynamic …
from pattern recognition to computational biology. Traditional algorithms based on dynamic …
Rule discovery in telecommunication alarm data
Fault management is an important but difficultarea of telecommunication network
management: networksproduce large amounts of alarm information which must beanalyzed …
management: networksproduce large amounts of alarm information which must beanalyzed …
Semi-local string comparison: Algorithmic techniques and applications
A Tiskin - Mathematics in Computer Science, 2008 - Springer
Given two strings, the longest common subsequence (LCS) problem consists in computing
the length of the longest string that is a subsequence of both input strings. Its generalisation …
the length of the longest string that is a subsequence of both input strings. Its generalisation …
Summarizing sequential data with closed partial orders
G Casas-Garriga - Proceedings of the 2005 SIAM International …, 2005 - SIAM
In this paper we address the task of summarizing a set of input sequences by means of local
ordering relationships on items occurring in the sequences. Our goal is not mining these …
ordering relationships on items occurring in the sequences. Our goal is not mining these …
Fine-grained complexity of analyzing compressed data: Quantifying improvements over decompress-and-solve
Can we analyze data without decompressing it? As our data keeps growing, understanding
the time complexity of problems on compressed inputs, rather than in convenient …
the time complexity of problems on compressed inputs, rather than in convenient …
A unified view of the apriori-based algorithms for frequent episode discovery
Frequent episode discovery framework is a popular framework in temporal data mining with
many applications. Over the years, many different notions of frequencies of episodes have …
many applications. Over the years, many different notions of frequencies of episodes have …