Solving -center Clustering (with Outliers) in MapReduce and Streaming, almost as Accurately as Sequentially

M Ceccarello, A Pietracaprina, G Pucci - arXiv preprint arXiv:1802.09205, 2018 - arxiv.org
Center-based clustering is a fundamental primitive for data analysis and becomes very
challenging for large datasets. In this paper, we focus on the popular $ k $-center variant …

Diversity maximization in the presence of outliers

D Amagata - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Given a set X of n points in a metric space, the problem of diversity maximization is to extract
a set S of k points from X so that the diversity of S is maximized. This problem is essential in …

Diverse data selection under fairness constraints

Z Moumoulidou, A McGregor, A Meliou - arXiv preprint arXiv:2010.09141, 2020 - arxiv.org
Diversity is an important principle in data selection and summarization, facility location, and
recommendation systems. Our work focuses on maximizing diversity in data selection, while …

Local search for max-sum diversification

A Cevallos, F Eisenbrand, R Zenklusen - … of the Twenty-Eighth Annual ACM …, 2017 - SIAM
We provide simple and fast polynomial-time approximation schemes (PTASs) for several
variants of the max-sum diversification problem which, in its most basic form, is as follows …

Fair max–min diversity maximization in streaming and sliding-window models

Y Wang, F Fabbri, M Mathioudakis, J Li - Entropy, 2023 - mdpi.com
Diversity maximization is a fundamental problem with broad applications in data
summarization, web search, and recommender systems. Given a set X of n elements, the …

Improved sliding window algorithms for clustering and coverage via bucketing-based sketches

A Epasto, M Mahdian, V Mirrokni, P Zhong - … of the 2022 Annual ACM-SIAM …, 2022 - SIAM
Streaming computation plays an important role in large-scale data analysis. The sliding
window model is a model of streaming computation which also captures the recency of the …

Composable core-sets for determinant maximization problems via spectral spanners

P Indyk, S Mahabadi, SO Gharan, A Rezaei - Proceedings of the Fourteenth …, 2020 - SIAM
We study a generalization of classical combinatorial graph spanners to the spectral setting.
Given a set of vectors V⊆ ℝ d, we say a set U⊆ V is an α-spectral k spanner, for k≤ d, if for …

Improved approximation and scalability for fair max-min diversification

R Addanki, A McGregor, A Meliou… - arXiv preprint arXiv …, 2022 - arxiv.org
Given an $ n $-point metric space $(\mathcal {X}, d) $ where each point belongs to one of $
m= O (1) $ different categories or groups and a set of integers $ k_1,\ldots, k_m $, the fair …

Composable core-sets for determinant maximization: A simple near-optimal algorithm

S Mahabadi, P Indyk, SO Gharan… - … on Machine Learning, 2019 - proceedings.mlr.press
Abstract “Composable core-sets” are an efficient framework for solving optimization
problems in massive data models. In this work, we consider efficient construction of …

Streaming algorithms for diversity maximization with fairness constraints

Y Wang, F Fabbri… - 2022 IEEE 38th …, 2022 - ieeexplore.ieee.org
Diversity maximization is a fundamental problem with wide applications in data
summarization, web search, and recommender systems. Given a set X of n elements, it asks …