A new coreset framework for clustering
V Cohen-Addad, D Saulpic… - Proceedings of the 53rd …, 2021 - dl.acm.org
Given a metric space, the (k, z)-clustering problem consists of finding k centers such that the
sum of the of distances raised to the power z of every point to its closest center is minimized …
sum of the of distances raised to the power z of every point to its closest center is minimized …
Improved coresets and sublinear algorithms for power means in euclidean spaces
V Cohen-Addad, D Saulpic… - Advances in Neural …, 2021 - proceedings.neurips.cc
In this paper, we consider the problem of finding high dimensional power means: given a set
$ A $ of $ n $ points in $\R^ d $, find the point $ m $ that minimizes the sum of Euclidean …
$ A $ of $ n $ points in $\R^ d $, find the point $ m $ that minimizes the sum of Euclidean …
Solving -center Clustering (with Outliers) in MapReduce and Streaming, almost as Accurately as Sequentially
Center-based clustering is a fundamental primitive for data analysis and becomes very
challenging for large datasets. In this paper, we focus on the popular $ k $-center variant …
challenging for large datasets. In this paper, we focus on the popular $ k $-center variant …
Diversity maximization in the presence of outliers
D Amagata - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Given a set X of n points in a metric space, the problem of diversity maximization is to extract
a set S of k points from X so that the diversity of S is maximized. This problem is essential in …
a set S of k points from X so that the diversity of S is maximized. This problem is essential in …
Diverse data selection under fairness constraints
Z Moumoulidou, A McGregor, A Meliou - arXiv preprint arXiv:2010.09141, 2020 - arxiv.org
Diversity is an important principle in data selection and summarization, facility location, and
recommendation systems. Our work focuses on maximizing diversity in data selection, while …
recommendation systems. Our work focuses on maximizing diversity in data selection, while …
Core-sets for fair and diverse data summarization
S Mahabadi, S Trajanovski - Advances in Neural …, 2024 - proceedings.neurips.cc
We study core-set construction algorithms for the task of Diversity Maximization under
fairness/partition constraint. Given a set of points $ P $ in a metric space partitioned into $ m …
fairness/partition constraint. Given a set of points $ P $ in a metric space partitioned into $ m …
[HTML][HTML] Fair max–min diversity maximization in streaming and sliding-window models
Diversity maximization is a fundamental problem with broad applications in data
summarization, web search, and recommender systems. Given a set X of n elements, the …
summarization, web search, and recommender systems. Given a set X of n elements, the …
[HTML][HTML] MapReduce algorithms for robust center-based clustering in doubling metrics
E Dandolo, A Mazzetto, A Pietracaprina… - Journal of Parallel and …, 2024 - Elsevier
Clustering is a pivotal primitive for unsupervised learning and data analysis. A popular
variant is the (k, ℓ)-clustering problem, where, given a pointset P from a metric space, one …
variant is the (k, ℓ)-clustering problem, where, given a pointset P from a metric space, one …
Improved approximation and scalability for fair max-min diversification
Given an $ n $-point metric space $(\mathcal {X}, d) $ where each point belongs to one of $
m= O (1) $ different categories or groups and a set of integers $ k_1,\ldots, k_m $, the fair …
m= O (1) $ different categories or groups and a set of integers $ k_1,\ldots, k_m $, the fair …
Streaming algorithms for diversity maximization with fairness constraints
Diversity maximization is a fundamental problem with wide applications in data
summarization, web search, and recommender systems. Given a set X of n elements, it asks …
summarization, web search, and recommender systems. Given a set X of n elements, it asks …