Binary classification with confidence difference
Recently, learning with soft labels has been shown to achieve better performance than
learning with hard labels in terms of model generalization, calibration, and robustness …
learning with hard labels in terms of model generalization, calibration, and robustness …
Demystifying the optimal performance of multi-class classification
Classification is a fundamental task in science and engineering on which machine learning
methods have shown outstanding performances. However, it is challenging to determine …
methods have shown outstanding performances. However, it is challenging to determine …
Certified robust accuracy of neural networks are bounded due to bayes errors
R Zhang, J Sun - International Conference on Computer Aided …, 2024 - Springer
Adversarial examples pose a security threat to many critical systems built on neural
networks. While certified training improves robustness, it also decreases accuracy …
networks. While certified training improves robustness, it also decreases accuracy …
Unified risk analysis for weakly supervised learning
CK Chiang, M Sugiyama - arXiv preprint arXiv:2309.08216, 2023 - arxiv.org
Among the flourishing research of weakly supervised learning (WSL), we recognize the lack
of a unified interpretation of the mechanism behind the weakly supervised scenarios, let …
of a unified interpretation of the mechanism behind the weakly supervised scenarios, let …
Beyond probability partitions: Calibrating neural networks with semantic aware grouping
Research has shown that deep networks tend to be overly optimistic about their predictions,
leading to an underestimation of prediction errors. Due to the limited nature of data, existing …
leading to an underestimation of prediction errors. Due to the limited nature of data, existing …
How Does Bayes Error Limit Probabilistic Robust Accuracy
R Zhang, J Sun - arXiv preprint arXiv:2405.14923, 2024 - arxiv.org
Adversarial examples pose a security threat to many critical systems built on neural
networks. Given that deterministic robustness often comes with significantly reduced …
networks. Given that deterministic robustness often comes with significantly reduced …
Data-Driven Estimation of the False Positive Rate of the Bayes Binary Classifier via Soft Labels
Classification is a fundamental task in many applications on which data-driven methods
have shown outstanding performances. However, it is challenging to determine whether …
have shown outstanding performances. However, it is challenging to determine whether …
Probabilistic Performance Bounds for Evaluating Depression Models Given Noisy Self-Report Labels
R Różański, E Shriberg, Y Lu, A Harati… - 2023 IEEE …, 2023 - ieeexplore.ieee.org
Advances in AI for health applications rely on evaluating performance against labeled test
data. In the area of mental health, self-report labels from surveys such as the Patient Health …
data. In the area of mental health, self-report labels from surveys such as the Patient Health …
Soft ascent-descent as a stable and flexible alternative to flooding
MJ Holland, K Nakatani - The Thirty-eighth Annual Conference on …, 2024 - openreview.net
As a heuristic for improving test accuracy in classification, the" flooding" method proposed by
Ishida et al.(2020) sets a threshold for the average surrogate loss at training time; above the …
Ishida et al.(2020) sets a threshold for the average surrogate loss at training time; above the …