K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data
Advances in recent techniques for scientific data collection in the era of big data allow for the
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …
[HTML][HTML] How much can k-means be improved by using better initialization and repeats?
P Fränti, S Sieranoja - Pattern Recognition, 2019 - Elsevier
In this paper, we study what are the most important factors that deteriorate the performance
of the k-means algorithm, and how much this deterioration can be overcome either by using …
of the k-means algorithm, and how much this deterioration can be overcome either by using …
[PDF][PDF] Clustering techniques: a brief survey of different clustering algorithms
Partitioning a set of objects into homogeneous clusters is a fundamental operation in data
mining. The operation is needed in a number of data mining tasks. Clustering or data …
mining. The operation is needed in a number of data mining tasks. Clustering or data …
A comparative study of efficient initialization methods for the k-means clustering algorithm
K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately,
due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of …
due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of …
A method for initialising the K-means clustering algorithm using kd-trees
SJ Redmond, C Heneghan - Pattern recognition letters, 2007 - Elsevier
We present a method for initialising the K-means clustering algorithm. Our method hinges on
the use of a kd-tree to perform a density estimation of the data at various locations. We then …
the use of a kd-tree to perform a density estimation of the data at various locations. We then …
Who does what: Collaboration patterns in the Wikipedia and their impact on article quality
The quality of Wikipedia articles is debatable. On the one hand, existing research indicates
that not only are people willing to contribute articles but the quality of these articles is close …
that not only are people willing to contribute articles but the quality of these articles is close …
Development of new seed with modified validity measures for k-means clustering
S Manochandar, M Punniyamoorthy… - Computers & Industrial …, 2020 - Elsevier
Conventional k-means clustering is the widely used partitional method, mainly adapted to
machine learning and pattern recognition problems. This algorithm is highly sensitive to …
machine learning and pattern recognition problems. This algorithm is highly sensitive to …
In search of deterministic methods for initializing K-means and Gaussian mixture clustering
T Su, JG Dy - Intelligent Data Analysis, 2007 - content.iospress.com
The performance of K-means and Gaussian mixture model (GMM) clustering depends on
the initial guess of partitions. Typically, clustering algorithms are initialized by random starts …
the initial guess of partitions. Typically, clustering algorithms are initialized by random starts …
Vector Field k‐Means: Clustering Trajectories by Fitting Multiple Vector Fields
N Ferreira, JT Klosowski… - Computer Graphics …, 2013 - Wiley Online Library
Scientists study trajectory data to understand trends in movement patterns, such as human
mobility for traffic analysis and urban planning. In this paper, we introduce a novel trajectory …
mobility for traffic analysis and urban planning. In this paper, we introduce a novel trajectory …
A two-stage genetic algorithm for automatic clustering
In this paper, a two-stage genetic clustering algorithm (TGCA) is proposed. This algorithm
can automatically determine the proper number of clusters and the proper partition from a …
can automatically determine the proper number of clusters and the proper partition from a …