Minimax rates of entropy estimation on large alphabets via best polynomial approximation
Consider the problem of estimating the Shannon entropy of a distribution over k elements
from n independent samples. We show that the minimax mean-square error is within the …
from n independent samples. We show that the minimax mean-square error is within the …
Universal outlier hypothesis testing
Y Li, S Nitinawarat, VV Veeravalli - IEEE Transactions on …, 2014 - ieeexplore.ieee.org
Outlier hypothesis testing is studied in a universal setting. Multiple sequences of
observations are collected, a small subset of which are outliers. A sequence is considered …
observations are collected, a small subset of which are outliers. A sequence is considered …
Polynomial methods in statistical inference: theory and practice
This survey provides an exposition of a suite of techniques based on the theory of
polynomials, collectively referred to as polynomial methods, which have recently been …
polynomials, collectively referred to as polynomial methods, which have recently been …
Large and small deviations for statistical sequence matching
We revisit the problem of statistical sequence matching between two databases of
sequences initiated by Unnikrishnan,(2015) and derive theoretical performance guarantees …
sequences initiated by Unnikrishnan,(2015) and derive theoretical performance guarantees …
Second-order asymptotically optimal statistical classification
Motivated by real-world machine learning applications, we analyse approximations to the
non-asymptotic fundamental limits of statistical classification. In the binary version of this …
non-asymptotic fundamental limits of statistical classification. In the binary version of this …
Kernel-based tests for likelihood-free hypothesis testing
PR Gerber, T Jiang, Y Polyanskiy… - Advances in Neural …, 2023 - proceedings.neurips.cc
Given $ n $ observations from two balanced classes, consider the task of labeling an
additional $ m $ inputs that are known to all belong to\emph {one} of the two classes …
additional $ m $ inputs that are known to all belong to\emph {one} of the two classes …
Sequential classification with empirically observed statistics
Motivated by real-world machine learning applications, we consider a statistical
classification task in a sequential setting where test samples arrive sequentially. In addition …
classification task in a sequential setting where test samples arrive sequentially. In addition …
Minimax optimal testing by classification
PR Gerber, Y Han, Y Polyanskiy - The Thirty Sixth Annual …, 2023 - proceedings.mlr.press
This paper considers an ML inspired approach to hypothesis testing known as
classifier/classification-accuracy testing (CAT). In CAT, one first trains a classifier by feeding …
classifier/classification-accuracy testing (CAT). In CAT, one first trains a classifier by feeding …
Likelihood-free hypothesis testing
PR Gerber, Y Polyanskiy - IEEE Transactions on Information …, 2024 - ieeexplore.ieee.org
Consider the problem of binary hypothesis testing. Given Z coming from either or, to decide
between the two with small probability of error it is sufficient, and in many cases necessary …
between the two with small probability of error it is sufficient, and in many cases necessary …
Statistical classification via robust hypothesis testing: Non-asymptotic and simple bounds
H Afşer - IEEE Signal Processing Letters, 2021 - ieeexplore.ieee.org
We consider Bayesian multiple statistical classification problem in the case where the
unknown source distributions are estimated from the labeled training sequences, then the …
unknown source distributions are estimated from the labeled training sequences, then the …