- 学术资源搜索

Data cleaning: Overview and emerging challenges

X Chu, IF Ilyas, S Krishnan, J Wang - Proceedings of the 2016 …, 2016 - dl.acm.org

Detecting and repairing dirty data is one of the perennial challenges in data analytics, and
failure to do so can result in inaccurate analytics and unreliable decisions. Over the past few …

被引用次数：729 相关文章所有 7 个版本

[PDF] mdpi.com

To compress or not to compress—self-supervised learning and information theory: A review

R Shwartz Ziv, Y LeCun - Entropy, 2024 - mdpi.com

Deep neural networks excel in supervised learning tasks but are constrained by the need for
extensive labeled data. Self-supervised learning emerges as a promising alternative …

被引用次数：77 相关文章所有 9 个版本

[PDF] nowpublishers.com

User-friendly introduction to PAC-Bayes bounds

P Alquier - Foundations and Trends® in Machine Learning, 2024 - nowpublishers.com

Aggregated predictors are obtained by making a set of basic predictors vote according to
some weights, that is, to some probability distribution. Randomized predictors are obtained …

被引用次数：206 相关文章所有 6 个版本

[PDF] mlr.press

A finite time analysis of temporal difference learning with linear function approximation

J Bhandari, D Russo, R Singal - Conference on learning …, 2018 - proceedings.mlr.press

Temporal difference learning (TD) is a simple iterative algorithm used to estimate the value
function corresponding to a given policy in a Markov decision process. Although TD is one of …

被引用次数：438 相关文章所有 11 个版本

[PDF] acm.org

Multiaccuracy: Black-box post-processing for fairness in classification

MP Kim, A Ghorbani, J Zou - Proceedings of the 2019 AAAI/ACM …, 2019 - dl.acm.org

Prediction systems are successfully deployed in applications ranging from disease
diagnosis, to predicting credit worthiness, to image recognition. Even when the overall …

被引用次数：404 相关文章所有 7 个版本

[PDF] neurips.cc

Information-theoretic analysis of generalization capability of learning algorithms

A Xu, M Raginsky - Advances in neural information …, 2017 - proceedings.neurips.cc

We derive upper bounds on the generalization error of a learning algorithm in terms of the
mutual information between its input and output. The bounds provide an information …

被引用次数：478 相关文章所有 7 个版本

[PDF] jmlr.org

Deep exploration via randomized value functions

I Osband, B Van Roy, DJ Russo, Z Wen - Journal of Machine Learning …, 2019 - jmlr.org

We study the use of randomized value functions to guide deep exploration in reinforcement
learning. This offers an elegant means for synthesizing statistically and computationally …

被引用次数：359 相关文章所有 9 个版本

[PDF] nature.com

A translational perspective towards clinical AI fairness

M Liu, Y Ning, S Teixayavong, M Mertens, J Xu… - NPJ Digital …, 2023 - nature.com

Artificial intelligence (AI) has demonstrated the ability to extract insights from data, but the
fairness of such data-driven insights remains a concern in high-stakes fields. Despite …

被引用次数：31 相关文章所有 10 个版本

[PDF] mlr.press

Reasoning about generalization via conditional mutual information

T Steinke, L Zakynthinou - Conference on Learning Theory, 2020 - proceedings.mlr.press

We provide an information-theoretic framework for studying the generalization properties of
machine learning algorithms. Our framework ties together existing approaches, including …

被引用次数：179 相关文章所有 7 个版本

[PDF] mlr.press

Simple bayesian algorithms for best arm identification

D Russo - Conference on Learning Theory, 2016 - proceedings.mlr.press

This paper considers the optimal adaptive allocation of measurement effort for identifying the
best among a finite set of options or designs. An experimenter sequentially chooses designs …

被引用次数：345 相关文章所有 10 个版本