Clustering stability: an overview
U Von Luxburg - Foundations and Trends® in Machine …, 2010 - nowpublishers.com
A popular method for selecting the number of clusters is based on stability arguments: one
chooses the number of clusters such that the corresponding clustering results are" most …
chooses the number of clusters such that the corresponding clustering results are" most …
Automatic learning styles prediction: A survey of the State-of-the-Art (2006–2021)
Learning systems, whether traditional or computerized ones, often have a teacher-based
design and use a “one-size-fits-all” approach. This approach ignores learners' differences …
design and use a “one-size-fits-all” approach. This approach ignores learners' differences …
A new method for identifying potential hazardous areas of heavy metal pollution in sediments
Y Li, X Cheng, K Liu, Y Yu, Y Zhou - Water Research, 2022 - Elsevier
The combined effect of pollution source discharge and sediment adsorption leads to the
rapid enrichment of heavy metals and other pollutants in lake sediments, which poses a …
rapid enrichment of heavy metals and other pollutants in lake sediments, which poses a …
What can we learn privately?
Learning problems form an important category of computational tasks that generalizes many
of the computations researchers apply to large real-life data sets. We ask, What concept …
of the computations researchers apply to large real-life data sets. We ask, What concept …
How many topics? stability analysis for topic models
Topic modeling refers to the task of discovering the underlying thematic structure in a text
corpus, where the output is commonly presented as a report of the top terms appearing in …
corpus, where the output is commonly presented as a report of the top terms appearing in …
Multi-level bootstrap analysis of stable clusters in resting-state fMRI
A variety of methods have been developed to identify brain networks with spontaneous,
coherent activity in resting-state functional magnetic resonance imaging (fMRI). We propose …
coherent activity in resting-state functional magnetic resonance imaging (fMRI). We propose …
Required sample sizes for data-driven market segmentation analyses in tourism
Data analysts in industry and academia make heavy use of market segmentation analysis to
develop tourism knowledge and select commercially attractive target segments. Within …
develop tourism knowledge and select commercially attractive target segments. Within …
Making k-means even faster
G Hamerly - Proceedings of the 2010 SIAM international conference …, 2010 - SIAM
The k-means algorithm is widely used for clustering, compressing, and summarizing vector
data. In this paper, we propose a new acceleration for exact k-means that gives the same …
data. In this paper, we propose a new acceleration for exact k-means that gives the same …
Replicable clustering
We design replicable algorithms in the context of statistical clustering under the recently
introduced notion of replicability from Impagliazzo et al.[2022]. According to this definition, a …
introduced notion of replicability from Impagliazzo et al.[2022]. According to this definition, a …
Center-based clustering under perturbation stability
Clustering under most popular objective functions is NP-hard, even to approximate well, and
so unlikely to be efficiently solvable in the worst case. Recently, Bilu and Linial (2010)[11] …
so unlikely to be efficiently solvable in the worst case. Recently, Bilu and Linial (2010)[11] …