Output space sampling for graph patterns

M Al Hasan, MJ Zaki - Proceedings of the VLDB Endowment, 2009 - dl.acm.org
Recent interest in graph pattern mining has shifted from finding all frequent subgraphs to
obtaining a small subset of frequent subgraphs that are representative, discriminative or …

Discovering association rules from big graphs

W Fan, W Fu, R Jin, P Lu, C Tian - Proceedings of the VLDB Endowment, 2022 - dl.acm.org
This paper tackles two challenges to discovery of graph rules. Existing discovery methods
often (a) return an excessive number of rules, and (b) do not scale with large graphs given …

Mining frequent itemsets through progressive sampling with rademacher averages

M Riondato, E Upfal - Proceedings of the 21th ACM SIGKDD …, 2015 - dl.acm.org
We present an algorithm to extract an high-quality approximation of the (top-k) Frequent
itemsets (FIs) from random samples of a transactional dataset. With high probability the …

Approximate parallel high utility itemset mining

Y Chen, A An - Big data research, 2016 - Elsevier
High utility itemset mining discovers itemsets whose utility is above a given threshold, where
the utility measures the importance of an itemset. It overcomes the limitation of frequent …

Parallel rule discovery from large datasets by sampling

W Fan, Z Han, Y Wang, M Xie - … of the 2022 international conference on …, 2022 - dl.acm.org
Rule discovery from large datasets is often prohibitively costly. The problem becomes more
staggering when the rules are collectively defined across multiple tables. To scale with large …

Efficient discovery of association rules and frequent itemsets through sampling with tight performance guarantees

M Riondato, E Upfal - ACM Transactions on Knowledge Discovery from …, 2014 - dl.acm.org
The tasks of extracting (top-K) Frequent Itemsets (FIs) and Association Rules (ARs) are
fundamental primitives in data mining and database applications. Exact algorithms for these …

MCRapper: Monte-Carlo Rademacher averages for poset families and approximate pattern mining

L Pellegrina, C Cousins, F Vandin… - ACM Transactions on …, 2022 - dl.acm.org
“I'm an MC still as honest”–Eminem, Rap God We present MCRapper, an algorithm for
efficient computation of Monte-Carlo Empirical Rademacher Averages (MCERA) for families …

Mining top-K frequent itemsets through progressive sampling

A Pietracaprina, M Riondato, E Upfal… - Data Mining and …, 2010 - Springer
We study the use of sampling for efficiently mining the top-K frequent itemsets of cardinality
at most w. To this purpose, we define an approximation to the top-K frequent itemsets to be a …

Efficient privacy-preserving recommendations based on social graphs

A Wainakh, T Grube, J Daubert… - Proceedings of the 13th …, 2019 - dl.acm.org
Many recommender systems use association rules mining, a technique that captures
relations between user interests and recommends new probable ones accordingly. Applying …

Extracting topics with focused communities for social content recommendation

T Georgiou, A El Abbadi, X Yan - … of the 2017 ACM Conference on …, 2017 - dl.acm.org
A thorough understanding of social media discussions and the demographics of the users
involved in these discussions has become critical for many applications like business or …