A new coreset framework for clustering

V Cohen-Addad, D Saulpic… - Proceedings of the 53rd …, 2021 - dl.acm.org
Given a metric space, the (k, z)-clustering problem consists of finding k centers such that the
sum of the of distances raised to the power z of every point to its closest center is minimized …

Improved coresets and sublinear algorithms for power means in euclidean spaces

V Cohen-Addad, D Saulpic… - Advances in Neural …, 2021 - proceedings.neurips.cc
In this paper, we consider the problem of finding high dimensional power means: given a set
$ A $ of $ n $ points in $\R^ d $, find the point $ m $ that minimizes the sum of Euclidean …

Solving -center Clustering (with Outliers) in MapReduce and Streaming, almost as Accurately as Sequentially

M Ceccarello, A Pietracaprina, G Pucci - arXiv preprint arXiv:1802.09205, 2018 - arxiv.org
Center-based clustering is a fundamental primitive for data analysis and becomes very
challenging for large datasets. In this paper, we focus on the popular $ k $-center variant …

Diversity maximization in the presence of outliers

D Amagata - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Given a set X of n points in a metric space, the problem of diversity maximization is to extract
a set S of k points from X so that the diversity of S is maximized. This problem is essential in …

Diverse data selection under fairness constraints

Z Moumoulidou, A McGregor, A Meliou - arXiv preprint arXiv:2010.09141, 2020 - arxiv.org
Diversity is an important principle in data selection and summarization, facility location, and
recommendation systems. Our work focuses on maximizing diversity in data selection, while …

Core-sets for fair and diverse data summarization

S Mahabadi, S Trajanovski - Advances in Neural …, 2024 - proceedings.neurips.cc
We study core-set construction algorithms for the task of Diversity Maximization under
fairness/partition constraint. Given a set of points $ P $ in a metric space partitioned into $ m …

[HTML][HTML] Fair max–min diversity maximization in streaming and sliding-window models

Y Wang, F Fabbri, M Mathioudakis, J Li - Entropy, 2023 - mdpi.com
Diversity maximization is a fundamental problem with broad applications in data
summarization, web search, and recommender systems. Given a set X of n elements, the …

[HTML][HTML] MapReduce algorithms for robust center-based clustering in doubling metrics

E Dandolo, A Mazzetto, A Pietracaprina… - Journal of Parallel and …, 2024 - Elsevier
Clustering is a pivotal primitive for unsupervised learning and data analysis. A popular
variant is the (k, ℓ)-clustering problem, where, given a pointset P from a metric space, one …

Improved approximation and scalability for fair max-min diversification

R Addanki, A McGregor, A Meliou… - arXiv preprint arXiv …, 2022 - arxiv.org
Given an $ n $-point metric space $(\mathcal {X}, d) $ where each point belongs to one of $
m= O (1) $ different categories or groups and a set of integers $ k_1,\ldots, k_m $, the fair …

Streaming algorithms for diversity maximization with fairness constraints

Y Wang, F Fabbri… - 2022 IEEE 38th …, 2022 - ieeexplore.ieee.org
Diversity maximization is a fundamental problem with wide applications in data
summarization, web search, and recommender systems. Given a set X of n elements, it asks …