Towards optimal lower bounds for k-median and k-means coresets
V Cohen-Addad, KG Larsen, D Saulpic… - Proceedings of the 54th …, 2022 - dl.acm.org
The (k, z)-clustering problem consists of finding a set of k points called centers, such that the
sum of distances raised to the power of z of every data point to its closest center is …
sum of distances raised to the power of z of every data point to its closest center is …
Improved approximations for Euclidean k-means and k-median, via nested quasi-independent sets
Motivated by data analysis and machine learning applications, we consider the popular high-
dimensional Euclidean k-median and k-means problems. We propose a new primal-dual …
dimensional Euclidean k-median and k-means problems. We propose a new primal-dual …
Multi-swap k-means++
L Beretta, V Cohen-Addad… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract The $ k $-means++ algorithm of Arthur and Vassilvitskii (SODA 2007) is often the
practitioners' choice algorithm for optimizing the popular $ k $-means clustering objective …
practitioners' choice algorithm for optimizing the popular $ k $-means clustering objective …
[PDF][PDF] Understanding the Cluster Linear Program for Correlation Clustering
In the classic Correlation Clustering problem introduced by Bansal, Blum, and Chawla
(FOCS 2002), the input is a complete graph where edges are labeled either+ or−, and the …
(FOCS 2002), the input is a complete graph where edges are labeled either+ or−, and the …
FPT constant-approximations for capacitated clustering to minimize the sum of cluster radii
Clustering with capacity constraints is a fundamental problem that attracted significant
attention throughout the years. In this paper, we give the first FPT constant-factor …
attention throughout the years. In this paper, we give the first FPT constant-factor …
Near-Optimal Private and Scalable -Clustering
We study the differentially private (DP) $ k $-means and $ k $-median clustering problems of
$ n $ points in $ d $-dimensional Euclidean space in the massively parallel computation …
$ n $ points in $ d $-dimensional Euclidean space in the massively parallel computation …
Metric clustering and MST with strong and weak distance oracles
MH Bateni, P Dharangutte… - The Thirty Seventh …, 2024 - proceedings.mlr.press
We study optimization problems in a metric space $(\mathcal {X}, d) $ where we can
compute distances in two ways: via a “strong” oracle that returns exact distances $ d (x, y) …
compute distances in two ways: via a “strong” oracle that returns exact distances $ d (x, y) …
The price of explainability for clustering
Given a set of points in d-dimensional space, an explainable clustering is one where the
clusters are specified by a tree of axis-aligned threshold cuts. Dasgupta et al.(ICML 2020) …
clusters are specified by a tree of axis-aligned threshold cuts. Dasgupta et al.(ICML 2020) …
Random cuts are optimal for explainable k-medians
K Makarychev, L Shan - Advances in Neural Information …, 2024 - proceedings.neurips.cc
We show that the RandomCoordinateCut algorithm gives the optimal competitive ratio for
explainable $ k $-medians in $\ell_1 $. The problem of explainable $ k $-medians was …
explainable $ k $-medians in $\ell_1 $. The problem of explainable $ k $-medians was …
Approximation algorithms for continuous clustering and facility location problems
D Chakrabarty, M Negahbani, A Sarkar - arXiv preprint arXiv:2206.15105, 2022 - arxiv.org
We consider the approximability of center-based clustering problems where the points to be
clustered lie in a metric space, and no candidate centers are specified. We call such …
clustered lie in a metric space, and no candidate centers are specified. We call such …