Improved approximation algorithms for individually fair clustering
A Vakilian, M Yalciner - International conference on artificial …, 2022 - proceedings.mlr.press
We consider the $ k $-clustering problem with $\ell_p $-norm cost, which includes $ k $-
median, $ k $-means and $ k $-center, under an individual notion of fairness proposed by …
median, $ k $-means and $ k $-center, under an individual notion of fairness proposed by …
Approximation algorithms for fair range clustering
SS Hotegni, S Mahabadi… - … Conference on Machine …, 2023 - proceedings.mlr.press
This paper studies the fair range clustering problem in which the data points are from
different demographic groups and the goal is to pick $ k $ centers with the minimum …
different demographic groups and the goal is to pick $ k $ centers with the minimum …
Efficient dynamic weighted set sampling and its extension
Given a weighted set S of n elements, weighted set sampling (WSS) samples an element in
S so that each element ai; is sampled with a probability proportional to its weight w (ai). The …
S so that each element ai; is sampled with a probability proportional to its weight w (ai). The …
Algorithmic techniques for independent query sampling
Y Tao - Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI …, 2022 - dl.acm.org
Unlike a reporting query that returns all the elements satisfying a predicate, query sampling
returns only a sample set of those elements and has long been recognized as an important …
returns only a sample set of those elements and has long been recognized as an important …
Fair near neighbor search via sampling
Similarity search is a fundamental algorithmic primitive, widely used in many computer
science disciplines. Given a set of points S and a radius parameter r> 0, the rnear neighbor …
science disciplines. Given a set of points S and a radius parameter r> 0, the rnear neighbor …
Independent range sampling on interval data
D Amagata - 2024 IEEE 40th International Conference on Data …, 2024 - ieeexplore.ieee.org
Many applications require efficient management of large sets of intervals because many
objects are associated with intervals (eg, time and price intervals). In such interval …
objects are associated with intervals (eg, time and price intervals). In such interval …
Simpler is Much Faster: Fair and Independent Inner Product Search
The problem of inner product search (IPS) is important in many fields. Although maximum
inner product search (MIPS) is often considered, its result is usually skewed and static …
inner product search (MIPS) is often considered, its result is usually skewed and static …
Bayesian label distribution propagation: A semi-supervised probabilistic k nearest neighbor classifier
Semi-supervised classification methods are specialized to use a very limited amount of
labeled data for training and ultimately for assigning labels to the vast majority of unlabeled …
labeled data for training and ultimately for assigning labels to the vast majority of unlabeled …
Sampling near neighbors in search for fairness
Similarity search is a fundamental algorithmic primitive, widely used in many computer
science disciplines. Given a set of points S and a radius parameter r> 0, the r-near neighbor …
science disciplines. Given a set of points S and a radius parameter r> 0, the r-near neighbor …
FairHash: A Fair and Memory/Time-efficient Hashmap
Hashmap is a fundamental data structure in computer science. There has been extensive
research on constructing hashmaps that minimize the number of collisions leading to …
research on constructing hashmaps that minimize the number of collisions leading to …