K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data

AM Ikotun, AE Ezugwu, L Abualigah, B Abuhaija… - Information …, 2023 - Elsevier
Advances in recent techniques for scientific data collection in the era of big data allow for the
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …

An ensemble modeling approach to forecast daily reservoir inflow using bidirectional long-and short-term memory (Bi-LSTM), variational mode decomposition (VMD) …

F Li, G Ma, S Chen, W Huang - Water Resources Management, 2021 - Springer
Daily inflow forecasts provide important decision support for the operations and
management of reservoirs. Accurate and reliable forecasting plays an important role in the …

[HTML][HTML] Adaptive explicit kernel minkowski weighted k-means

A Aradnia, MA Haeri, MM Ebadzadeh - Information sciences, 2022 - Elsevier
The K-means algorithm is among the most commonly used data clustering methods.
However, the regular K-means can only be applied in the input space, and it is applicable …

Objective-based hierarchical clustering of deep embedding vectors

S Naumov, G Yaroslavtsev, D Avdiukhin - Proceedings of the AAAI …, 2021 - ojs.aaai.org
We initiate a comprehensive experimental study of objective-based hierarchical clustering
methods on massive datasets consisting of deep embedding vectors from computer vision …

Approximate high dimensional graph mining with matrix polar factorization: A Twitter application

G Drakopoulos, E Kafeza, P Mylonas… - … Conference on Big …, 2021 - ieeexplore.ieee.org
At the dawn of the Internet era graph analytics play an important role in high-and low-level
network policymaking across a wide array of fields so diverse as transportation network …

A multi-model fusion soft measurement method for cement clinker f-CaO content based on K-means++ and EMD-MKRVM

R Zhang, S Lu, X Wang, H Yu… - Transactions of the …, 2023 - journals.sagepub.com
The content of free calcium oxide (f-CaO) in cement clinker is a key indicator for testing the
quality of cement clinker. To address the problem that the content of f-CaO cannot be …

On The Relative Error of Random Fourier Features for Preserving Kernel Distance

K Cheng, SHC Jiang, L Wei, Z Wei - The Eleventh International …, 2022 - openreview.net
The method of random Fourier features (RFF), proposed in a seminal paper by Rahimi and
Recht (NIPS'07), is a powerful technique to find approximate low-dimensional …

Efficient High-Dimensional Kernel k-Means++ with Random Projection

JYK Chan, AP Leung, Y Xie - Applied Sciences, 2021 - mdpi.com
Using random projection, a method to speed up both kernel k-means and centroid
initialization with k-means++ is proposed. We approximate the kernel matrix and distances …

New fuzzy subtractive clustering approach: an application of order allocation in e-supply chain system

NVQ Nhu, NV Hop - International Journal of Logistics …, 2024 - inderscienceonline.com
This study proposes a so-called new fuzzy subtractive clustering (NFSC) algorithm to
allocate orders to the appropriate hub with three criteria, namely, travelling distance, delivery …

Study Of Existing Methods & Techniques Of K-Means Clustering

S Yadav, S Sharma - Educational Administration: Theory And Practice, 2024 - kuey.net
In the field of data mining, clustering is the technique of grouping millions of data points to
form clusters. Data of the same class are grouped together. K-Means clustering is the most …