A recurrent neural network without chaos

M Chen, A Beutel, P Covington, S Jain… - Proceedings of the …, 2019 - dl.acm.org

Industrial recommender systems deal with extremely large action spaces--many millions of
items to recommend. Moreover, they need to serve billions of users, who are unique at any …

被引用次数：532 相关文章所有 10 个版本

Robustness of LSTM neural networks for multi-step forecasting of chaotic time series

M Sangiorgio, F Dercole - Chaos, Solitons & Fractals, 2020 - Elsevier

Recurrent neurons (and in particular LSTM cells) demonstrated to be efficient when used as
basic blocks to build sequence to sequence architectures, which represent the state-of-the …

被引用次数：192 相关文章所有 5 个版本

[PDF] mlr.press

Forecasting sequential data using consistent koopman autoencoders

O Azencot, NB Erichson, V Lin… - … on Machine Learning, 2020 - proceedings.mlr.press

Recurrent neural networks are widely used on time series data, yet such models often
ignore the underlying physical structures in such sequences. A new class of physics-based …

被引用次数：174 相关文章所有 9 个版本

[PDF] hal.science

Deep learning for hand gesture recognition on skeletal data

G Devineau, F Moutarde, W Xi… - 2018 13th IEEE …, 2018 - ieeexplore.ieee.org

In this paper, we introduce a new 3D hand gesture recognition approach based on a deep
learning model. We propose a new Convolutional Neural Network (CNN) where sequences …

被引用次数：217 相关文章所有 13 个版本

[PDF] arxiv.org

Provable benefit of orthogonal initialization in optimizing deep linear networks

W Hu, L Xiao, J Pennington - arXiv preprint arXiv:2001.05992, 2020 - arxiv.org

The selection of initial parameter values for gradient-based optimization of deep neural
networks is one of the most impactful hyperparameter choices in deep learning systems …

被引用次数：147 相关文章所有 7 个版本

[PDF] arxiv.org

Coupled oscillatory recurrent neural network (cornn): An accurate and (gradient) stable architecture for learning long time dependencies

TK Rusch, S Mishra - arXiv preprint arXiv:2010.00951, 2020 - arxiv.org

Circuits of biological neurons, such as in the functional parts of the brain can be modeled as
networks of coupled oscillators. Inspired by the ability of these systems to express a rich set …

被引用次数：102 相关文章所有 10 个版本

[PDF] arxiv.org

Stable recurrent models

J Miller, M Hardt - arXiv preprint arXiv:1805.10369, 2018 - arxiv.org

Stability is a fundamental property of dynamical systems, yet to this date it has had little
bearing on the practice of recurrent neural networks. In this work, we conduct a thorough …

被引用次数：160 相关文章所有 6 个版本

[PDF] mlr.press

Beyond exploding and vanishing gradients: analysing RNN training using attractors and smoothness

AH Ribeiro, K Tiels, LA Aguirre… - … conference on artificial …, 2020 - proceedings.mlr.press

The exploding and vanishing gradient problem has been the major conceptual principle
behind most architecture and training improvements in recurrent neural networks (RNNs) …

被引用次数：151 相关文章所有 7 个版本

[PDF] neurips.cc

Preventing gradient explosions in gated recurrent units

S Kanai, Y Fujiwara, S Iwamura - Advances in neural …, 2017 - proceedings.neurips.cc

A gated recurrent unit (GRU) is a successful recurrent neural network architecture for time-
series data. The GRU is typically trained using a gradient-based method, which is subject to …

被引用次数：133 相关文章所有 6 个版本

[PDF] acm.org

Values of user exploration in recommender systems

M Chen, Y Wang, C Xu, Y Le, M Sharma… - Proceedings of the 15th …, 2021 - dl.acm.org

Reinforcement Learning (RL) has been sought after to bring next-generation recommender
systems to further improve user experience on recommendation platforms. While the …

被引用次数：51 相关文章