Improved approximation algorithms for individually fair clustering

A Vakilian, M Yalciner - International conference on artificial …, 2022 - proceedings.mlr.press
We consider the $ k $-clustering problem with $\ell_p $-norm cost, which includes $ k $-
median, $ k $-means and $ k $-center, under an individual notion of fairness proposed by …

Approximation algorithms for fair range clustering

SS Hotegni, S Mahabadi… - … Conference on Machine …, 2023 - proceedings.mlr.press
This paper studies the fair range clustering problem in which the data points are from
different demographic groups and the goal is to pick $ k $ centers with the minimum …

Efficient dynamic weighted set sampling and its extension

F Zhang, M Jiang, S Wang - Proceedings of the VLDB Endowment, 2023 - dl.acm.org
Given a weighted set S of n elements, weighted set sampling (WSS) samples an element in
S so that each element ai; is sampled with a probability proportional to its weight w (ai). The …

Algorithmic techniques for independent query sampling

Y Tao - Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI …, 2022 - dl.acm.org
Unlike a reporting query that returns all the elements satisfying a predicate, query sampling
returns only a sample set of those elements and has long been recognized as an important …

Fair near neighbor search via sampling

M Aumuller, S Har-Peled, S Mahabadi, R Pagh… - ACM SIGMOD …, 2021 - dl.acm.org
Similarity search is a fundamental algorithmic primitive, widely used in many computer
science disciplines. Given a set of points S and a radius parameter r> 0, the rnear neighbor …

Independent range sampling on interval data

D Amagata - 2024 IEEE 40th International Conference on Data …, 2024 - ieeexplore.ieee.org
Many applications require efficient management of large sets of intervals because many
objects are associated with intervals (eg, time and price intervals). In such interval …

Simpler is Much Faster: Fair and Independent Inner Product Search

K Aoyama, D Amagata, S Fujita, T Hara - Proceedings of the 46th …, 2023 - dl.acm.org
The problem of inner product search (IPS) is important in many fields. Although maximum
inner product search (MIPS) is often considered, its result is usually skewed and static …

Bayesian label distribution propagation: A semi-supervised probabilistic k nearest neighbor classifier

JMN Gøttcke, A Zimek, RJGB Campello - Information Systems, 2024 - Elsevier
Semi-supervised classification methods are specialized to use a very limited amount of
labeled data for training and ultimately for assigning labels to the vast majority of unlabeled …

Sampling near neighbors in search for fairness

M Aumüller, S Har-Peled, S Mahabadi, R Pagh… - Communications of the …, 2022 - dl.acm.org
Similarity search is a fundamental algorithmic primitive, widely used in many computer
science disciplines. Given a set of points S and a radius parameter r> 0, the r-near neighbor …

FairHash: A Fair and Memory/Time-efficient Hashmap

N Shahbazi, S Sintos, A Asudeh - … of the ACM on Management of Data, 2024 - dl.acm.org
Hashmap is a fundamental data structure in computer science. There has been extensive
research on constructing hashmaps that minimize the number of collisions leading to …