A review on the long short-term memory model- 学术资源搜索

A review on the long short-term memory model

G Van Houdt, C Mosquera, G Nápoles - Artificial Intelligence Review, 2020 - Springer

Artificial Intelligence Review, 2020•Springer

Long short-term memory (LSTM) has transformed both machine learning and
neurocomputing fields. According to several online sources, this model has improved
Google's speech recognition, greatly improved machine translations on Google Translate,
and the answers of Amazon's Alexa. This neural system is also employed by Facebook,
reaching over 4 billion LSTM-based translations per day as of 2017. Interestingly, recurrent
neural networks had shown a rather discrete performance until LSTM showed up. One …

Abstract

Long short-term memory (LSTM) has transformed both machine learning and neurocomputing fields. According to several online sources, this model has improved Google’s speech recognition, greatly improved machine translations on Google Translate, and the answers of Amazon’s Alexa. This neural system is also employed by Facebook, reaching over 4 billion LSTM-based translations per day as of 2017. Interestingly, recurrent neural networks had shown a rather discrete performance until LSTM showed up. One reason for the success of this recurrent network lies in its ability to handle the exploding/vanishing gradient problem, which stands as a difficult issue to be circumvented when training recurrent or very deep neural networks. In this paper, we present a comprehensive review that covers LSTM’s formulation and training, relevant applications reported in the literature and code resources implementing this model for a toy example.

Springer

展开收起

被引用次数：1153 相关文章所有 9 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果