Improved Coresets for Euclidean -Means
V Cohen-Addad, K Green Larsen… - Advances in …, 2022 - proceedings.neurips.cc
Given a set of $ n $ points in $ d $ dimensions, the Euclidean $ k $-means problem (resp.
Euclidean $ k $-median) consists of finding $ k $ centers such that the sum of squared …
Euclidean $ k $-median) consists of finding $ k $ centers such that the sum of squared …
A new coreset framework for clustering
V Cohen-Addad, D Saulpic… - Proceedings of the 53rd …, 2021 - dl.acm.org
Given a metric space, the (k, z)-clustering problem consists of finding k centers such that the
sum of the of distances raised to the power z of every point to its closest center is minimized …
sum of the of distances raised to the power z of every point to its closest center is minimized …
Towards optimal lower bounds for k-median and k-means coresets
V Cohen-Addad, KG Larsen, D Saulpic… - Proceedings of the 54th …, 2022 - dl.acm.org
The (k, z)-clustering problem consists of finding a set of k points called centers, such that the
sum of distances raised to the power of z of every data point to its closest center is …
sum of distances raised to the power of z of every data point to its closest center is …
The power of uniform sampling for coresets
V Braverman, V Cohen-Addad… - 2022 IEEE 63rd …, 2022 - ieeexplore.ieee.org
Motivated by practical generalizations of the classic k-median and k-means objectives, such
as clustering with size constraints, fair clustering, and Wasserstein barycenter, we introduce …
as clustering with size constraints, fair clustering, and Wasserstein barycenter, we introduce …
Coresets for clustering in euclidean spaces: importance sampling is nearly optimal
L Huang, NK Vishnoi - Proceedings of the 52nd Annual ACM SIGACT …, 2020 - dl.acm.org
Given a collection of n points in ℝ d, the goal of the (k, z)-clustering problem is to find a
subset of k “centers” that minimizes the sum of the z-th powers of the Euclidean distance of …
subset of k “centers” that minimizes the sum of the z-th powers of the Euclidean distance of …
Improved coresets and sublinear algorithms for power means in euclidean spaces
V Cohen-Addad, D Saulpic… - Advances in Neural …, 2021 - proceedings.neurips.cc
In this paper, we consider the problem of finding high dimensional power means: given a set
$ A $ of $ n $ points in $\R^ d $, find the point $ m $ that minimizes the sum of Euclidean …
$ A $ of $ n $ points in $\R^ d $, find the point $ m $ that minimizes the sum of Euclidean …
Oblivious dimension reduction for k-means: beyond subspaces and the Johnson-Lindenstrauss lemma
We show that for n points in d-dimensional Euclidean space, a data oblivious random
projection of the columns onto m∈ O ((log k+ loglog n) ε− 6log1/ε) dimensions is sufficient to …
projection of the columns onto m∈ O ((log k+ loglog n) ε− 6log1/ε) dimensions is sufficient to …
Near-Optimal -Clustering in the Sliding Window Model
Clustering is an important technique for identifying structural information in large-scale data
analysis, where the underlying dataset may be too large to store. In many applications …
analysis, where the underlying dataset may be too large to store. In many applications …
On the fixed-parameter tractability of capacitated clustering
V Cohen-Addad, J Li - arXiv preprint arXiv:2208.14129, 2022 - arxiv.org
We study the complexity of the classic capacitated k-median and k-means problems
parameterized by the number of centers, k. These problems are notoriously difficult since the …
parameterized by the number of centers, k. These problems are notoriously difficult since the …
On generalization bounds for projective clustering
MS Bucarelli, M Larsen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Given a set of points, clustering consists of finding a partition of a point set into $ k $ clusters
such that the center to which a point is assigned is as close as possible. Most commonly …
such that the center to which a point is assigned is as close as possible. Most commonly …