Initialization of cluster refinement algorithms: A review and comparative study

K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data

AM Ikotun, AE Ezugwu, L Abualigah, B Abuhaija… - Information …, 2023 - Elsevier

Advances in recent techniques for scientific data collection in the era of big data allow for the
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …

被引用次数：762 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] How much can k-means be improved by using better initialization and repeats?

P Fränti, S Sieranoja - Pattern Recognition, 2019 - Elsevier

In this paper, we study what are the most important factors that deteriorate the performance
of the k-means algorithm, and how much this deterioration can be overcome either by using …

被引用次数：462 相关文章所有 9 个版本

[PDF] academia.edu

[PDF][PDF] Clustering techniques: a brief survey of different clustering algorithms

D Sisodia, L Singh, S Sisodia… - International Journal of …, 2012 - academia.edu

Partitioning a set of objects into homogeneous clusters is a fundamental operation in data
mining. The operation is needed in a number of data mining tasks. Clustering or data …

被引用次数：141 相关文章所有 4 个版本

[PDF] arxiv.org

A comparative study of efficient initialization methods for the k-means clustering algorithm

ME Celebi, HA Kingravi, PA Vela - Expert systems with applications, 2013 - Elsevier

K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately,
due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of …

被引用次数：1501 相关文章所有 10 个版本

[PDF] psu.edu

A method for initialising the K-means clustering algorithm using kd-trees

SJ Redmond, C Heneghan - Pattern recognition letters, 2007 - Elsevier

We present a method for initialising the K-means clustering algorithm. Our method hinges on
the use of a kd-tree to perform a density estimation of the data at various locations. We then …

被引用次数：436 相关文章所有 8 个版本

[PDF] dsu.edu

Who does what: Collaboration patterns in the Wikipedia and their impact on article quality

J Liu, S Ram - ACM Transactions on Management Information …, 2011 - dl.acm.org

The quality of Wikipedia articles is debatable. On the one hand, existing research indicates
that not only are people willing to contribute articles but the quality of these articles is close …

被引用次数：240 相关文章所有 3 个版本

Development of new seed with modified validity measures for k-means clustering

S Manochandar, M Punniyamoorthy… - Computers & Industrial …, 2020 - Elsevier

Conventional k-means clustering is the widely used partitional method, mainly adapted to
machine learning and pattern recognition problems. This algorithm is highly sensitive to …

被引用次数：65 相关文章所有 2 个版本

[PDF] northeastern.edu

In search of deterministic methods for initializing K-means and Gaussian mixture clustering

T Su, JG Dy - Intelligent Data Analysis, 2007 - content.iospress.com

The performance of K-means and Gaussian mixture model (GMM) clustering depends on
the initial guess of partitions. Typically, clustering algorithms are initialized by random starts …

被引用次数：195 相关文章所有 13 个版本

[PDF] arxiv.org

Vector Field k‐Means: Clustering Trajectories by Fitting Multiple Vector Fields

N Ferreira, JT Klosowski… - Computer Graphics …, 2013 - Wiley Online Library

Scientists study trajectory data to understand trends in movement patterns, such as human
mobility for traffic analysis and urban planning. In this paper, we introduce a novel trajectory …

被引用次数：144 相关文章所有 12 个版本

[PDF] psu.edu

A two-stage genetic algorithm for automatic clustering

H He, Y Tan - Neurocomputing, 2012 - Elsevier

In this paper, a two-stage genetic clustering algorithm (TGCA) is proposed. This algorithm
can automatically determine the proper number of clusters and the proper partition from a …

被引用次数：152 相关文章所有 8 个版本