Is the performance of my deep network too good to be true? A direct approach to estimating...

W Wang, L Feng, Y Jiang, G Niu… - Advances in …, 2024 - proceedings.neurips.cc

Recently, learning with soft labels has been shown to achieve better performance than
learning with hard labels in terms of model generalization, calibration, and robustness …

被引用次数：5 相关文章所有 8 个版本

[PDF] neurips.cc

Demystifying the optimal performance of multi-class classification

M Jeong, M Cardone, A Dytso - Advances in Neural …, 2024 - proceedings.neurips.cc

Classification is a fundamental task in science and engineering on which machine learning
methods have shown outstanding performances. However, it is challenging to determine …

被引用次数：3 相关文章所有 4 个版本

[PDF] springer.com

Certified robust accuracy of neural networks are bounded due to bayes errors

R Zhang, J Sun - International Conference on Computer Aided …, 2024 - Springer

Adversarial examples pose a security threat to many critical systems built on neural
networks. While certified training improves robustness, it also decreases accuracy …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Unified risk analysis for weakly supervised learning

CK Chiang, M Sugiyama - arXiv preprint arXiv:2309.08216, 2023 - arxiv.org

Among the flourishing research of weakly supervised learning (WSL), we recognize the lack
of a unified interpretation of the mechanism behind the weakly supervised scenarios, let …

被引用次数：2 相关文章所有 2 个版本

[PDF] neurips.cc

Beyond probability partitions: Calibrating neural networks with semantic aware grouping

JQ Yang, DC Zhan, L Gan - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Research has shown that deep networks tend to be overly optimistic about their predictions,
leading to an underestimation of prediction errors. Due to the limited nature of data, existing …

被引用次数：5 相关文章所有 6 个版本

[PDF] arxiv.org

How Does Bayes Error Limit Probabilistic Robust Accuracy

R Zhang, J Sun - arXiv preprint arXiv:2405.14923, 2024 - arxiv.org

Adversarial examples pose a security threat to many critical systems built on neural
networks. Given that deterministic robustness often comes with significantly reduced …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Data-Driven Estimation of the False Positive Rate of the Bayes Binary Classifier via Soft Labels

M Jeong, M Cardone, A Dytso - arXiv preprint arXiv:2401.15500, 2024 - arxiv.org

Classification is a fundamental task in many applications on which data-driven methods
have shown outstanding performances. However, it is challenging to determine whether …

被引用次数：1 相关文章所有 2 个版本

Probabilistic Performance Bounds for Evaluating Depression Models Given Noisy Self-Report Labels

R Różański, E Shriberg, Y Lu, A Harati… - 2023 IEEE …, 2023 - ieeexplore.ieee.org

Advances in AI for health applications rely on evaluating performance against labeled test
data. In the area of mental health, self-report labels from surveys such as the Patient Health …

Soft ascent-descent as a stable and flexible alternative to flooding

MJ Holland, K Nakatani - The Thirty-eighth Annual Conference on …, 2024 - openreview.net

As a heuristic for improving test accuracy in classification, the" flooding" method proposed by
Ishida et al.(2020) sets a threshold for the average surrogate loss at training time; above the …