A tutorial on statistically sound pattern discovery

W Hämäläinen, GI Webb - Data Mining and Knowledge Discovery, 2019 - Springer
Statistically sound pattern discovery harnesses the rigour of statistical hypothesis testing to
overcome many of the issues that have hampered standard data mining approaches to …

Significance-based discriminative sequential pattern mining

Z He, S Zhang, J Wu - Expert Systems with Applications, 2019 - Elsevier
Discriminative sequential patterns are sub-sequences whose occurrences exhibit significant
differences across sequential data sets with different class labels. The discovery of such …

Efficient mining of the most significant patterns with permutation testing

L Pellegrina, F Vandin - Proceedings of the 24th ACM SIGKDD …, 2018 - dl.acm.org
The extraction of patterns displaying significant association with a class label is a key data
mining task with wide application in many domains. We study a variant of the problem that …

Selective inference for sparse high-order interaction models

S Suzumura, K Nakagawa, Y Umezu… - International …, 2017 - proceedings.mlr.press
Finding statistically significant high-order interactions in predictive modeling is important but
challenging task because the possible number of high-order interactions is extremely large …

SPuManTE: Significant pattern mining with unconditional testing

L Pellegrina, M Riondato, F Vandin - Proceedings of the 25th ACM …, 2019 - dl.acm.org
We present SPuManTE, an efficient algorithm for mining significant patterns from a
transactional dataset. SPuManTE controls the Family-wise Error Rate: it ensures that the …

Efficient feature selection using shrinkage estimators

K Sechidis, L Azzimonti, A Pocock, G Corani… - Machine Learning, 2019 - Springer
Abstract Information theoretic feature selection methods quantify the importance of each
feature by estimating mutual information terms to capture: the relevancy, the redundancy …

Discovering significant patterns under sequential false discovery control

S Dalleiger, J Vreeken - Proceedings of the 28th ACM SIGKDD …, 2022 - dl.acm.org
We are interested in discovering those patterns from data with an empirical frequency that is
significantly differently than expected. To avoid spurious results, yet achieve high statistical …

Finding interpretable class-specific patterns through efficient neural search

NP Walter, J Fischer, J Vreeken - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Discovering patterns in data that best describe the differences between classes allows to
hypothesize and reason about class-specific mechanisms. In molecular biology, for …

Permutation strategies for mining significant sequential patterns

A Tonon, F Vandin - 2019 IEEE International Conference on …, 2019 - ieeexplore.ieee.org
The identification of significant patterns, defined as patterns whose frequency significantly
deviates from what is expected under a suitable null model of the data, is a key data mining …

Association mapping in biomedical time series via statistically significant shapelet mining

C Bock, T Gumbsch, M Moor, B Rieck… - …, 2018 - academic.oup.com
Motivation Most modern intensive care units record the physiological and vital signs of
patients. These data can be used to extract signatures, commonly known as biomarkers, that …