Solving -center Clustering (with Outliers) in MapReduce and Streaming, almost as Accurately as Sequentially
Center-based clustering is a fundamental primitive for data analysis and becomes very
challenging for large datasets. In this paper, we focus on the popular $ k $-center variant …
challenging for large datasets. In this paper, we focus on the popular $ k $-center variant …
Diversity maximization in the presence of outliers
D Amagata - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Given a set X of n points in a metric space, the problem of diversity maximization is to extract
a set S of k points from X so that the diversity of S is maximized. This problem is essential in …
a set S of k points from X so that the diversity of S is maximized. This problem is essential in …
Diverse data selection under fairness constraints
Z Moumoulidou, A McGregor, A Meliou - arXiv preprint arXiv:2010.09141, 2020 - arxiv.org
Diversity is an important principle in data selection and summarization, facility location, and
recommendation systems. Our work focuses on maximizing diversity in data selection, while …
recommendation systems. Our work focuses on maximizing diversity in data selection, while …
Local search for max-sum diversification
We provide simple and fast polynomial-time approximation schemes (PTASs) for several
variants of the max-sum diversification problem which, in its most basic form, is as follows …
variants of the max-sum diversification problem which, in its most basic form, is as follows …
Fair max–min diversity maximization in streaming and sliding-window models
Diversity maximization is a fundamental problem with broad applications in data
summarization, web search, and recommender systems. Given a set X of n elements, the …
summarization, web search, and recommender systems. Given a set X of n elements, the …
Improved sliding window algorithms for clustering and coverage via bucketing-based sketches
Streaming computation plays an important role in large-scale data analysis. The sliding
window model is a model of streaming computation which also captures the recency of the …
window model is a model of streaming computation which also captures the recency of the …
Composable core-sets for determinant maximization problems via spectral spanners
We study a generalization of classical combinatorial graph spanners to the spectral setting.
Given a set of vectors V⊆ ℝ d, we say a set U⊆ V is an α-spectral k spanner, for k≤ d, if for …
Given a set of vectors V⊆ ℝ d, we say a set U⊆ V is an α-spectral k spanner, for k≤ d, if for …
Improved approximation and scalability for fair max-min diversification
Given an $ n $-point metric space $(\mathcal {X}, d) $ where each point belongs to one of $
m= O (1) $ different categories or groups and a set of integers $ k_1,\ldots, k_m $, the fair …
m= O (1) $ different categories or groups and a set of integers $ k_1,\ldots, k_m $, the fair …
Composable core-sets for determinant maximization: A simple near-optimal algorithm
Abstract “Composable core-sets” are an efficient framework for solving optimization
problems in massive data models. In this work, we consider efficient construction of …
problems in massive data models. In this work, we consider efficient construction of …
Streaming algorithms for diversity maximization with fairness constraints
Diversity maximization is a fundamental problem with wide applications in data
summarization, web search, and recommender systems. Given a set X of n elements, it asks …
summarization, web search, and recommender systems. Given a set X of n elements, it asks …