A review of clustering techniques and developments

A Saxena, M Prasad, A Gupta, N Bharill, OP Patel… - Neurocomputing, 2017 - Elsevier
This paper presents a comprehensive study on clustering: exiting methods and
developments made at various times. Clustering is defined as an unsupervised learning …

Review of clustering technology and its application in coordinating vehicle subsystems

C Zhang, W Huang, T Niu, Z Liu, G Li, D Cao - Automotive Innovation, 2023 - Springer
Clustering is an unsupervised learning technology, and it groups information (observations
or datasets) according to similarity measures. Developing clustering algorithms is a hot topic …

An enhanced version of black hole algorithm via levy flight for optimization and data clustering problems

HA Abdulwahab, A Noraziah, AA Alsewari… - Ieee …, 2019 - ieeexplore.ieee.org
The processes of retrieving useful information from a dataset are an important data mining
technique that is commonly applied, known as Data Clustering. Recently, nature-inspired …

K-means and k-medoids: Cluster analysis on birth data collected in city Muzaffarabad, Kashmir

SA Abbas, A Aslam, AU Rehman, WA Abbasi… - IEEE …, 2020 - ieeexplore.ieee.org
In the field of medical, each and every analysis is decisive as the study links to life of the
subject under observation. One of the most vital area in the field of medical is the healthcare …

A survey of preprocessing methods used for analysis of big data originated from smart grids

TA Alghamdi, N Javaid - Ieee Access, 2022 - ieeexplore.ieee.org
In this paper, a brief survey of data preprocessing methods is presented. Specifically, the
data preprocessing methods used in the smart grid (SG) domain are surveyed. Also, with the …

{VBASE}: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity

Q Zhang, S Xu, Q Chen, G Sui, J Xie, Z Cai… - … USENIX Symposium on …, 2023 - usenix.org
Approximate similarity queries on high-dimensional vector indices have become the
cornerstone for many critical online services. An increasing need for more sophisticated …

Systematic review of clustering high-dimensional and large datasets

D Pandove, S Goel, R Rani - … on Knowledge Discovery from Data (TKDD …, 2018 - dl.acm.org
Technological advancement has enabled us to store and process huge amount of data in
relatively short spans of time. The nature of data is rapidly changing, particularly its …

Parallel and distributed architecture of genetic algorithm on Apache Hadoop and Spark

HC Lu, FJ Hwang, YH Huang - Applied Soft Computing, 2020 - Elsevier
The genetic algorithm (GA), one of the best-known metaheuristic algorithms, has been
extensively utilized in various fields of management science, operational research, and …

Proactive dynamic virtual-machine consolidation for energy conservation in cloud data centres

S Ismaeel, R Karim, A Miri - Journal of Cloud Computing, 2018 - Springer
Data center power consumption is among the largest commodity expenditures for many
organizations. Reduction of power used in cloud data centres with heterogeneous physical …

[PDF][PDF] Big data clustering: Algorithms and challenges

B Zerhari, AA Lahcen, S Mouline - Proc. of Int. Conf. on Big Data …, 2015 - researchgate.net
Big Data is usually defined by three characteristics called 3Vs (Volume, Velocity and
Variety). It refers to data that are too large, dynamic and complex. In this context, data are …