Towards optimal lower bounds for k-median and k-means coresets

V Cohen-Addad, KG Larsen, D Saulpic… - Proceedings of the 54th …, 2022 - dl.acm.org
The (k, z)-clustering problem consists of finding a set of k points called centers, such that the
sum of distances raised to the power of z of every data point to its closest center is …

Improved approximations for Euclidean k-means and k-median, via nested quasi-independent sets

V Cohen-Addad, H Esfandiari, V Mirrokni… - Proceedings of the 54th …, 2022 - dl.acm.org
Motivated by data analysis and machine learning applications, we consider the popular high-
dimensional Euclidean k-median and k-means problems. We propose a new primal-dual …

Multi-swap k-means++

L Beretta, V Cohen-Addad… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract The $ k $-means++ algorithm of Arthur and Vassilvitskii (SODA 2007) is often the
practitioners' choice algorithm for optimizing the popular $ k $-means clustering objective …

[PDF][PDF] Understanding the Cluster Linear Program for Correlation Clustering

N Cao, V Cohen-Addad, E Lee, S Li… - Proceedings of the 56th …, 2024 - dl.acm.org
In the classic Correlation Clustering problem introduced by Bansal, Blum, and Chawla ‍
(FOCS 2002), the input is a complete graph where edges are labeled either+ or−, and the …

FPT constant-approximations for capacitated clustering to minimize the sum of cluster radii

S Bandyapadhyay, W Lochet, S Saurabh - arXiv preprint arXiv:2303.07923, 2023 - arxiv.org
Clustering with capacity constraints is a fundamental problem that attracted significant
attention throughout the years. In this paper, we give the first FPT constant-factor …

Near-Optimal Private and Scalable -Clustering

V Cohen-Addad, A Epasto, V Mirrokni… - Advances in …, 2022 - proceedings.neurips.cc
We study the differentially private (DP) $ k $-means and $ k $-median clustering problems of
$ n $ points in $ d $-dimensional Euclidean space in the massively parallel computation …

Metric clustering and MST with strong and weak distance oracles

MH Bateni, P Dharangutte… - The Thirty Seventh …, 2024 - proceedings.mlr.press
We study optimization problems in a metric space $(\mathcal {X}, d) $ where we can
compute distances in two ways: via a “strong” oracle that returns exact distances $ d (x, y) …

The price of explainability for clustering

A Gupta, MR Pittu, O Svensson… - 2023 IEEE 64th Annual …, 2023 - ieeexplore.ieee.org
Given a set of points in d-dimensional space, an explainable clustering is one where the
clusters are specified by a tree of axis-aligned threshold cuts. Dasgupta et al.(ICML 2020) …

Random cuts are optimal for explainable k-medians

K Makarychev, L Shan - Advances in Neural Information …, 2024 - proceedings.neurips.cc
We show that the RandomCoordinateCut algorithm gives the optimal competitive ratio for
explainable $ k $-medians in $\ell_1 $. The problem of explainable $ k $-medians was …

Approximation algorithms for continuous clustering and facility location problems

D Chakrabarty, M Negahbani, A Sarkar - arXiv preprint arXiv:2206.15105, 2022 - arxiv.org
We consider the approximability of center-based clustering problems where the points to be
clustered lie in a metric space, and no candidate centers are specified. We call such …