Binary classification with confidence difference

W Wang, L Feng, Y Jiang, G Niu… - Advances in …, 2024 - proceedings.neurips.cc
Recently, learning with soft labels has been shown to achieve better performance than
learning with hard labels in terms of model generalization, calibration, and robustness …

Demystifying the optimal performance of multi-class classification

M Jeong, M Cardone, A Dytso - Advances in Neural …, 2024 - proceedings.neurips.cc
Classification is a fundamental task in science and engineering on which machine learning
methods have shown outstanding performances. However, it is challenging to determine …

Certified robust accuracy of neural networks are bounded due to bayes errors

R Zhang, J Sun - International Conference on Computer Aided …, 2024 - Springer
Adversarial examples pose a security threat to many critical systems built on neural
networks. While certified training improves robustness, it also decreases accuracy …

Unified risk analysis for weakly supervised learning

CK Chiang, M Sugiyama - arXiv preprint arXiv:2309.08216, 2023 - arxiv.org
Among the flourishing research of weakly supervised learning (WSL), we recognize the lack
of a unified interpretation of the mechanism behind the weakly supervised scenarios, let …

Beyond probability partitions: Calibrating neural networks with semantic aware grouping

JQ Yang, DC Zhan, L Gan - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Research has shown that deep networks tend to be overly optimistic about their predictions,
leading to an underestimation of prediction errors. Due to the limited nature of data, existing …

How Does Bayes Error Limit Probabilistic Robust Accuracy

R Zhang, J Sun - arXiv preprint arXiv:2405.14923, 2024 - arxiv.org
Adversarial examples pose a security threat to many critical systems built on neural
networks. Given that deterministic robustness often comes with significantly reduced …

Data-Driven Estimation of the False Positive Rate of the Bayes Binary Classifier via Soft Labels

M Jeong, M Cardone, A Dytso - arXiv preprint arXiv:2401.15500, 2024 - arxiv.org
Classification is a fundamental task in many applications on which data-driven methods
have shown outstanding performances. However, it is challenging to determine whether …

Probabilistic Performance Bounds for Evaluating Depression Models Given Noisy Self-Report Labels

R Różański, E Shriberg, Y Lu, A Harati… - 2023 IEEE …, 2023 - ieeexplore.ieee.org
Advances in AI for health applications rely on evaluating performance against labeled test
data. In the area of mental health, self-report labels from surveys such as the Patient Health …

Soft ascent-descent as a stable and flexible alternative to flooding

MJ Holland, K Nakatani - The Thirty-eighth Annual Conference on …, 2024 - openreview.net
As a heuristic for improving test accuracy in classification, the" flooding" method proposed by
Ishida et al.(2020) sets a threshold for the average surrogate loss at training time; above the …