Recovering unbalanced communities in the stochastic block model with application to clustering with a faulty oracle

CS Mukherjee, P Peng, J Zhang - Advances in Neural …, 2024 - proceedings.neurips.cc
The stochastic block model (SBM) is a fundamental model for studying graph clustering or
community detection in networks. It has received great attention in the last decade and the …

Clustering with queries under semi-random noise

A Del Pia, M Ma, C Tzamos - Conference on Learning …, 2022 - proceedings.mlr.press
The seminal paper by Mazumdar and Saha (2017a) introduced an extensive line of work on
clustering with noisy queries. Yet, despite significant progress on the problem, the proposed …

Clustering Items From Adaptively Collected Inconsistent Feedback

S Gupta, PWJ Staar… - … Conference on Artificial …, 2024 - proceedings.mlr.press
We study clustering in a query-based model where the learner can repeatedly query an
oracle to determine if two items belong to the same cluster. However, these queries are …

Gap-Free Clustering: Sensitivity and Robustness of SDP

M Zurek, Y Chen - The Thirty Seventh Annual Conference on …, 2024 - proceedings.mlr.press
We study graph clustering in the Stochastic Block Model (SBM) in the presence of both large
clusters and small, unrecoverable clusters. Previous convex relaxation approaches …

Clustering Without an Eigengap

M Zurek, Y Chen - arXiv preprint arXiv:2308.15642, 2023 - arxiv.org
We study graph clustering in the Stochastic Block Model (SBM) in the presence of both large
clusters and small, unrecoverable clusters. Previous approaches achieving exact recovery …

Error-Tolerant Exact Query Learning of Finite Set Partitions with Same-Cluster Oracle

AF DePavia, OMM del Campo, E Tani - arXiv preprint arXiv:2305.13402, 2023 - arxiv.org
This paper initiates the study of active learning for exact recovery of partitions exclusively
through access to a same-cluster oracle in the presence of bounded adversarial error. We …

Confident Clustering via PCA Compression Ratio and Its Application to Single-cell RNA-seq Analysis

Y Li, CS Mukherjee, J Zhang - arXiv preprint arXiv:2205.09849, 2022 - arxiv.org
Unsupervised clustering algorithms for vectors has been widely used in the area of machine
learning. Many applications, including the biological data we studied in this paper, contain …

Optimal Algorithms for Learning Partitions with Faulty Oracles

AF DePavia, OMM del Campo, E Tani - The Thirty-eighth Annual … - openreview.net
We consider a clustering problem where a learner seeks to partition a finite set by querying
a faulty oracle. This models applications where learners crowdsource information from non …