Clustering stability: an overview

U Von Luxburg - Foundations and Trends® in Machine …, 2010 - nowpublishers.com
A popular method for selecting the number of clusters is based on stability arguments: one
chooses the number of clusters such that the corresponding clustering results are" most …

Automatic learning styles prediction: A survey of the State-of-the-Art (2006–2021)

M Raleiras, AH Nabizadeh, FA Costa - Journal of Computers in Education, 2022 - Springer
Learning systems, whether traditional or computerized ones, often have a teacher-based
design and use a “one-size-fits-all” approach. This approach ignores learners' differences …

A new method for identifying potential hazardous areas of heavy metal pollution in sediments

Y Li, X Cheng, K Liu, Y Yu, Y Zhou - Water Research, 2022 - Elsevier
The combined effect of pollution source discharge and sediment adsorption leads to the
rapid enrichment of heavy metals and other pollutants in lake sediments, which poses a …

What can we learn privately?

SP Kasiviswanathan, HK Lee, K Nissim… - SIAM Journal on …, 2011 - SIAM
Learning problems form an important category of computational tasks that generalizes many
of the computations researchers apply to large real-life data sets. We ask, What concept …

How many topics? stability analysis for topic models

D Greene, D O'Callaghan, P Cunningham - Machine Learning and …, 2014 - Springer
Topic modeling refers to the task of discovering the underlying thematic structure in a text
corpus, where the output is commonly presented as a report of the top terms appearing in …

Multi-level bootstrap analysis of stable clusters in resting-state fMRI

P Bellec, P Rosa-Neto, OC Lyttelton, H Benali… - Neuroimage, 2010 - Elsevier
A variety of methods have been developed to identify brain networks with spontaneous,
coherent activity in resting-state functional magnetic resonance imaging (fMRI). We propose …

Required sample sizes for data-driven market segmentation analyses in tourism

S Dolnicar, B Grün, F Leisch… - Journal of Travel …, 2014 - journals.sagepub.com
Data analysts in industry and academia make heavy use of market segmentation analysis to
develop tourism knowledge and select commercially attractive target segments. Within …

Making k-means even faster

G Hamerly - Proceedings of the 2010 SIAM international conference …, 2010 - SIAM
The k-means algorithm is widely used for clustering, compressing, and summarizing vector
data. In this paper, we propose a new acceleration for exact k-means that gives the same …

Replicable clustering

H Esfandiari, A Karbasi, V Mirrokni… - Advances in …, 2024 - proceedings.neurips.cc
We design replicable algorithms in the context of statistical clustering under the recently
introduced notion of replicability from Impagliazzo et al.[2022]. According to this definition, a …

Center-based clustering under perturbation stability

P Awasthi, A Blum, O Sheffet - Information Processing Letters, 2012 - Elsevier
Clustering under most popular objective functions is NP-hard, even to approximate well, and
so unlikely to be efficiently solvable in the worst case. Recently, Bilu and Linial (2010)[11] …