Fast learning rates with heavy-tailed losses

B Guedj - arXiv preprint arXiv:1901.05353, 2019 - arxiv.org

Generalised Bayesian learning algorithms are increasingly popular in machine learning,
due to their PAC generalisation properties and flexibility. The present paper aims at …

被引用次数：224 相关文章所有 8 个版本

[PDF] mlr.press

Power k-means clustering

J Xu, K Lange - International conference on machine …, 2019 - proceedings.mlr.press

Clustering is a fundamental task in unsupervised machine learning. Lloyd's 1957 algorithm
for k-means clustering remains one of the most widely used due to its speed and simplicity …

被引用次数：103 相关文章所有 2 个版本

[PDF] jmlr.org

Fast rates for general unbounded loss functions: from ERM to generalized Bayes

PD Grünwald, NA Mehta - Journal of Machine Learning Research, 2020 - jmlr.org

We present new excess risk bounds for general unbounded loss functions including log loss
and squared loss, where the distribution of the losses may be heavy-tailed. The bounds hold …

被引用次数：82 相关文章所有 6 个版本

[HTML] springer.com

[HTML][HTML] Simpler PAC-Bayesian bounds for hostile data

P Alquier, B Guedj - Machine Learning, 2018 - Springer

PAC-Bayesian learning bounds are of the utmost interest to the learning community. Their
role is to connect the generalization ability of an aggregation distribution ρ ρ to its empirical …

被引用次数：79 相关文章所有 18 个版本

[PDF] neurips.cc

-regression with Heavy-tailed Distributions

L Zhang, ZH Zhou - Advances in Neural Information …, 2018 - proceedings.neurips.cc

In this paper, we consider the problem of linear regression with heavy-tailed distributions.
Different from previous studies that use the squared loss to measure the performance, we …

被引用次数：29 相关文章所有 8 个版本

[PDF] mlr.press

Learning with non-convex truncated losses by SGD

Y Xu, S Zhu, S Yang, C Zhang… - Uncertainty in Artificial …, 2020 - proceedings.mlr.press

Learning with a convex loss function has been a dominating paradigm for many years. It
remains an interesting question how non-convex loss functions help improve the …

被引用次数：28 相关文章所有 7 个版本

[PDF] mlr.press

Uniform deviation bounds for k-means clustering

O Bachem, M Lucic, SH Hassani… - … on machine learning, 2017 - proceedings.mlr.press

Uniform deviation bounds limit the difference between a model's expected loss and its loss
on an empirical sample uniformly for all models in a learning problem. In this paper, we …

被引用次数：24 相关文章所有 8 个版本

[PDF] researchgate.net

An effective AQI estimation using sensor data and stacking mechanism

DQ Duong, QM Le, TL Nguyen-Tai… - New Trends in …, 2021 - ebooks.iospress.nl

Accurately assessing the air quality index (AQI) values and levels has become an attractive
research topic during the last decades. It is a crucial aspect when studying the possible …

被引用次数：10 相关文章所有 4 个版本

On strong consistency of kernel k-means: A Rademacher complexity approach

A Chakrabarty, S Das - Statistics & Probability Letters, 2022 - Elsevier

We provide uniform concentration bounds on the kernel k-means clustering objective based
on Rademacher complexity by posing the underlying problem as a risk minimization task …

被引用次数：5 相关文章所有 4 个版本

[PDF] arxiv.org

Bagging Improves Generalization Exponentially

H Jie, D Ying, H Lam, W Yin - arXiv preprint arXiv:2405.14741, 2024 - arxiv.org

Bagging is a popular ensemble technique to improve the accuracy of machine learning
models. It hinges on the well-established rationale that, by repeatedly retraining on …