Problem formulations and solvers in linear SVM: a review

VK Chauhan, K Dahiya, A Sharma - Artificial Intelligence Review, 2019 - Springer
Support vector machine (SVM) is an optimal margin based classification technique in
machine learning. SVM is a binary linear classifier which has been extended to non-linear …

On Markov chain Monte Carlo methods for tall data

R Bardenet, A Doucet, C Holmes - Journal of Machine Learning Research, 2017 - jmlr.org
Markov chain Monte Carlo methods are often deemed too computationally intensive to be of
any practical use for big data applications, and in particular for inference on datasets …

Outrageously large neural networks: The sparsely-gated mixture-of-experts layer

N Shazeer, A Mirhoseini, K Maziarz, A Davis… - arXiv preprint arXiv …, 2017 - arxiv.org
The capacity of a neural network to absorb information is limited by its number of
parameters. Conditional computation, where parts of the network are active on a per …

[图书][B] Deep learning

I Goodfellow, Y Bengio, A Courville - 2016 - books.google.com
An introduction to a broad range of topics in deep learning, covering mathematical and
conceptual background, deep learning techniques used in industry, and research …

Moel: Mixture of empathetic listeners

Z Lin, A Madotto, J Shin, P Xu, P Fung - arXiv preprint arXiv:1908.07687, 2019 - arxiv.org
Previous research on empathetic dialogue systems has mostly focused on generating
responses given certain emotions. However, being empathetic not only requires the ability of …

[图书][B] Deep learning

Y Bengio, I Goodfellow, A Courville - 2017 - academia.edu
Inventors have long dreamed of creating machines that think. Ancient Greek myths tell of
intelligent objects, such as animated statues of human beings and tables that arrive full of …

Extreme learning machine for regression and multiclass classification

GB Huang, H Zhou, X Ding… - IEEE Transactions on …, 2011 - ieeexplore.ieee.org
Due to the simplicity of their implementations, least square support vector machine (LS-
SVM) and proximal support vector machine (PSVM) have been widely used in binary …

[PDF][PDF] Causal discovery with continuous additive noise models

We consider the problem of learning causal directed acyclic graphs from an observational
joint distribution. One can use these graphs to predict the outcome of interventional …

Towards understanding the mixture-of-experts layer in deep learning

Z Chen, Y Deng, Y Wu, Q Gu… - Advances in neural …, 2022 - proceedings.neurips.cc
Abstract The Mixture-of-Experts (MoE) layer, a sparsely-activated model controlled by a
router, has achieved great success in deep learning. However, the understanding of such …

[PDF][PDF] 神经网络极速学习方法研究

邓万宇, 郑庆华, 陈琳, 许学斌 - 计算机学报, 2010 - cjc.ict.ac.cn
摘要单隐藏层前馈神经网络(Single hiddenLayerFeedforwardNeuralNetwork, SLFN)
已经在模式识别, 自动控制及数据挖掘等领域取得了广泛的应用, 但传统学习方法的速度远远 …