Occam's razor

X Hu, L Chu, J Pei, W Liu, J Bian - Knowledge and Information Systems, 2021 - Springer

Abstract Model complexity is a fundamental problem in deep learning. In this paper, we
conduct a systematic overview of the latest studies on model complexity in deep learning …

被引用次数：254 相关文章所有 6 个版本

[PDF] cam.ac.uk

Probabilistic machine learning and artificial intelligence

Z Ghahramani - Nature, 2015 - nature.com

How can a machine learn from experience? Probabilistic modelling provides a framework
for understanding what learning is, and has therefore emerged as one of the principal …

被引用次数：2224 相关文章所有 22 个版本

[PDF] jmlr.org

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

T Hoefler, D Alistarh, T Ben-Nun, N Dryden… - Journal of Machine …, 2021 - jmlr.org

The growing energy and performance costs of deep learning have driven the community to
reduce the size of neural networks by selectively pruning components. Similarly to their …

被引用次数：687 相关文章所有 27 个版本

[PDF] neurips.cc

Bayesian deep learning and a probabilistic perspective of generalization

AG Wilson, P Izmailov - Advances in neural information …, 2020 - proceedings.neurips.cc

The key distinguishing property of a Bayesian approach is marginalization, rather than using
a single setting of weights. Bayesian marginalization can particularly improve the accuracy …

被引用次数：687 相关文章所有 6 个版本

[PDF] sciencedirect.com

Hidden physics models: Machine learning of nonlinear partial differential equations

M Raissi, GE Karniadakis - Journal of Computational Physics, 2018 - Elsevier

While there is currently a lot of enthusiasm about “big data”, useful data is usually “small”
and expensive to acquire. In this paper, we present a new paradigm of learning partial …

被引用次数：1274 相关文章所有 9 个版本

[PDF] academia.edu

[图书][B] Mathematics for machine learning

MP Deisenroth, AA Faisal, CS Ong - 2020 - books.google.com

The fundamental mathematical tools needed to understand machine learning include linear
algebra, analytic geometry, matrix decompositions, vector calculus, optimization, probability …

被引用次数：793 相关文章所有 12 个版本

[PDF] arxiv.org

Sensitivity and generalization in neural networks: an empirical study

R Novak, Y Bahri, DA Abolafia, J Pennington… - arXiv preprint arXiv …, 2018 - arxiv.org

In practice it is often found that large over-parameterized neural networks generalize better
than their smaller counterparts, an observation that appears to conflict with classical notions …

被引用次数：477 相关文章所有 12 个版本

[PDF] sciencedirect.com

Machine learning of linear differential equations using Gaussian processes

M Raissi, P Perdikaris, GE Karniadakis - Journal of Computational Physics, 2017 - Elsevier

This work leverages recent advances in probabilistic machine learning to discover
governing equations expressed by parametric linear operators. Such equations involve, but …

被引用次数：614 相关文章所有 6 个版本

[PDF] mlr.press

Deep kernel learning

AG Wilson, Z Hu, R Salakhutdinov… - Artificial intelligence …, 2016 - proceedings.mlr.press

We introduce scalable deep kernels, which combine the structural properties of deep
learning architectures with the non-parametric flexibility of kernel methods. Specifically, we …

被引用次数：998 相关文章所有 10 个版本

[PDF] rsc.org

Constrained Bayesian optimization for automatic chemical design using variational autoencoders

RR Griffiths, JM Hernández-Lobato - Chemical science, 2020 - pubs.rsc.org

Automatic Chemical Design is a framework for generating novel molecules with optimized
properties. The original scheme, featuring Bayesian optimization over the latent space of a …

被引用次数：408 相关文章所有 13 个版本

Model complexity of deep learning: A survey