A tutorial on statistically sound pattern discovery
W Hämäläinen, GI Webb - Data Mining and Knowledge Discovery, 2019 - Springer
Statistically sound pattern discovery harnesses the rigour of statistical hypothesis testing to
overcome many of the issues that have hampered standard data mining approaches to …
overcome many of the issues that have hampered standard data mining approaches to …
Significance-based discriminative sequential pattern mining
Z He, S Zhang, J Wu - Expert Systems with Applications, 2019 - Elsevier
Discriminative sequential patterns are sub-sequences whose occurrences exhibit significant
differences across sequential data sets with different class labels. The discovery of such …
differences across sequential data sets with different class labels. The discovery of such …
Efficient mining of the most significant patterns with permutation testing
L Pellegrina, F Vandin - Proceedings of the 24th ACM SIGKDD …, 2018 - dl.acm.org
The extraction of patterns displaying significant association with a class label is a key data
mining task with wide application in many domains. We study a variant of the problem that …
mining task with wide application in many domains. We study a variant of the problem that …
Selective inference for sparse high-order interaction models
S Suzumura, K Nakagawa, Y Umezu… - International …, 2017 - proceedings.mlr.press
Finding statistically significant high-order interactions in predictive modeling is important but
challenging task because the possible number of high-order interactions is extremely large …
challenging task because the possible number of high-order interactions is extremely large …
SPuManTE: Significant pattern mining with unconditional testing
We present SPuManTE, an efficient algorithm for mining significant patterns from a
transactional dataset. SPuManTE controls the Family-wise Error Rate: it ensures that the …
transactional dataset. SPuManTE controls the Family-wise Error Rate: it ensures that the …
Efficient feature selection using shrinkage estimators
Abstract Information theoretic feature selection methods quantify the importance of each
feature by estimating mutual information terms to capture: the relevancy, the redundancy …
feature by estimating mutual information terms to capture: the relevancy, the redundancy …
Discovering significant patterns under sequential false discovery control
S Dalleiger, J Vreeken - Proceedings of the 28th ACM SIGKDD …, 2022 - dl.acm.org
We are interested in discovering those patterns from data with an empirical frequency that is
significantly differently than expected. To avoid spurious results, yet achieve high statistical …
significantly differently than expected. To avoid spurious results, yet achieve high statistical …
Finding interpretable class-specific patterns through efficient neural search
Discovering patterns in data that best describe the differences between classes allows to
hypothesize and reason about class-specific mechanisms. In molecular biology, for …
hypothesize and reason about class-specific mechanisms. In molecular biology, for …
Permutation strategies for mining significant sequential patterns
The identification of significant patterns, defined as patterns whose frequency significantly
deviates from what is expected under a suitable null model of the data, is a key data mining …
deviates from what is expected under a suitable null model of the data, is a key data mining …
Association mapping in biomedical time series via statistically significant shapelet mining
Motivation Most modern intensive care units record the physiological and vital signs of
patients. These data can be used to extract signatures, commonly known as biomarkers, that …
patients. These data can be used to extract signatures, commonly known as biomarkers, that …