WB-index: A sum-of-squares based index for cluster validity

Q Zhao, P Fränti - Data & Knowledge Engineering, 2014 - Elsevier
Determining the number of clusters is an important part of cluster validity that has been
widely studied in cluster analysis. Sum-of-squares based indices show promising properties …

[HTML][HTML] K-sets and k-swaps algorithms for clustering sets

M Rezaei, P Fränti - Pattern Recognition, 2023 - Elsevier
We present two new clustering algorithms called k-sets and k-swaps for data where each
object is a set. First, we define the mean of the sets in a cluster, and the distance between a …

Disease prediction in data mining using association rule mining and keyword based clustering algorithms

S Ramasamy, K Nirmala - International Journal of Computers and …, 2020 - Taylor & Francis
The health sector today contains hidden information that can be important in making
decisions. It is difficult for medical practitioners to predict the disease as it is a complex task …

Matching similarity for keyword-based clustering

M Rezaei, P Fränti - … , Syntactic, and Statistical Pattern Recognition: Joint …, 2014 - Springer
Semantic clustering of objects such as documents, web sites and movies based on their
keywords is a challenging problem. This requires a similarity measure between two sets of …

Cuckoo-filter based privacy-aware search over encrypted cloud data

Q Xue, MC Chuah - … 11th international conference on mobile ad …, 2015 - ieeexplore.ieee.org
Many organizations and individual users are out-sourcing their information which includes
sensitive data into the cloud. To deal with the potential risks of privacy exposure, such data …

A Technique to Pre-trained Neural Network Language Model Customization to Software Development Domain

PV Dudarin, VG Tronin, KV Svyatov - Artificial Intelligence: 17th Russian …, 2019 - Springer
According to the CHAOS report from Standish Group during 1992–2017, the degree of
success of projects in the development of software intensive systems (Software Intensive …

Clustering of users on microblogging social media: A rough set based approach

M Gupta, P Kumar, B Bhasker - 2016 International Conference …, 2016 - ieeexplore.ieee.org
Microblogging social media platforms like Twitter, Tumblr and Plurk have radically changed
our lives. The presence of millions of people has made these platforms a preferred channel …

Connecting graphs to machine learning

G Candel - 2022 - theses.hal.science
This thesis proposes new approaches to process graph using machine learning algorithms
designed for tabular data. A graph is a data structure made of nodes linked to each others by …

[HTML][HTML] ОБЗОР ПОДХОДОВ КЛАСТЕРИЗАЦИИ ПОИСКОВЫХ КЛЮЧЕВЫХ ФРАЗ ПО СЕМАНТИЧЕСКОЙ СХОЖЕСТИ МЕТОДАМИ МАШИННОГО ОБУЧЕНИЯ

ЕМ Бушуев - Вестник науки, 2023 - cyberleninka.ru
В статье анализируются основные методы и подходы кластерного анализа
семантического ядра с применением методов машинного обучения. Рассматриваются …

[PDF][PDF] An approach to customization of pre-trained neural network language model to specific domain

D PV, T VG, S KV - 2019 - dialog-21.ru
Nowadays the majority of tasks in NLP field are solved by means of neural network
language models. These models already have shown state-ofthe-art results in classification …