A parallel mixture of SVMs for very large scale problems

VK Chauhan, K Dahiya, A Sharma - Artificial Intelligence Review, 2019 - Springer

Support vector machine (SVM) is an optimal margin based classification technique in
machine learning. SVM is a binary linear classifier which has been extended to non-linear …

被引用次数：459 相关文章所有 4 个版本

[PDF] jmlr.org

On Markov chain Monte Carlo methods for tall data

R Bardenet, A Doucet, C Holmes - Journal of Machine Learning Research, 2017 - jmlr.org

Markov chain Monte Carlo methods are often deemed too computationally intensive to be of
any practical use for big data applications, and in particular for inference on datasets …

被引用次数：334 相关文章所有 12 个版本

[PDF] openreview.net

Outrageously large neural networks: The sparsely-gated mixture-of-experts layer

N Shazeer, A Mirhoseini, K Maziarz, A Davis… - arXiv preprint arXiv …, 2017 - arxiv.org

The capacity of a neural network to absorb information is limited by its number of
parameters. Conditional computation, where parts of the network are active on a per …

被引用次数：2017 相关文章所有 16 个版本

[PDF] e-hir.org

[图书][B] Deep learning

I Goodfellow, Y Bengio, A Courville - 2016 - books.google.com

An introduction to a broad range of topics in deep learning, covering mathematical and
conceptual background, deep learning techniques used in industry, and research …

被引用次数：65023 相关文章所有 25 个版本

[PDF] arxiv.org

Moel: Mixture of empathetic listeners

Z Lin, A Madotto, J Shin, P Xu, P Fung - arXiv preprint arXiv:1908.07687, 2019 - arxiv.org

Previous research on empathetic dialogue systems has mostly focused on generating
responses given certain emotions. However, being empathetic not only requires the ability of …

被引用次数：221 相关文章所有 6 个版本

[PDF] academia.edu

[图书][B] Deep learning

Y Bengio, I Goodfellow, A Courville - 2017 - academia.edu

Inventors have long dreamed of creating machines that think. Ancient Greek myths tell of
intelligent objects, such as animated statues of human beings and tables that arrive full of …

被引用次数：1973 相关文章所有 4 个版本

[PDF] infn.it

Extreme learning machine for regression and multiclass classification

GB Huang, H Zhou, X Ding… - IEEE Transactions on …, 2011 - ieeexplore.ieee.org

Due to the simplicity of their implementations, least square support vector machine (LS-
SVM) and proximal support vector machine (PSVM) have been widely used in binary …

被引用次数：6340 相关文章所有 13 个版本

[PDF] jmlr.org

[PDF][PDF] Causal discovery with continuous additive noise models

J Peters, JM Mooij, D Janzing, B Schölkopf - 2014 - jmlr.org

We consider the problem of learning causal directed acyclic graphs from an observational
joint distribution. One can use these graphs to predict the outcome of interventional …

被引用次数：578 相关文章所有 23 个版本

[PDF] neurips.cc

Towards understanding the mixture-of-experts layer in deep learning

Z Chen, Y Deng, Y Wu, Q Gu… - Advances in neural …, 2022 - proceedings.neurips.cc

Abstract The Mixture-of-Experts (MoE) layer, a sparsely-activated model controlled by a
router, has achieved great success in deep learning. However, the understanding of such …

被引用次数：36 相关文章所有 6 个版本

[PDF] ict.ac.cn