Feature dimensionality reduction: a review

W Jia, M Sun, J Lian, S Hou - Complex & Intelligent Systems, 2022 - Springer
As basic research, it has also received increasing attention from people that the “curse of
dimensionality” will lead to increase the cost of data storage and computing; it also …

Speech recognition using deep neural networks: A systematic review

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org
Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

Recent advances in convolutional neural networks

J Gu, Z Wang, J Kuen, L Ma, A Shahroudy, B Shuai… - Pattern recognition, 2018 - Elsevier
In the last few years, deep learning has led to very good performance on a variety of
problems, such as visual recognition, speech recognition and natural language processing …

A streaming on-device end-to-end model surpassing server-side conventional model quality and latency

TN Sainath, Y He, B Li, A Narayanan… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Thus far, end-to-end (E2E) models have not been shown to outperform state-of-the-art
conventional models with respect to both quality, ie, word error rate (WER), and latency, ie …

Convolutional neural networks for speech recognition

O Abdel-Hamid, A Mohamed, H Jiang… - … on audio, speech …, 2014 - ieeexplore.ieee.org
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been
shown to significantly improve speech recognition performance over the conventional …

[图书][B] Automatic speech recognition

D Yu, L Deng - 2016 - Springer
Automatic Speech Recognition (ASR), which is aimed to enable natural human–machine
interaction, has been an intensive research area for decades. Many core technologies, such …

Speech emotion recognition using deep neural network and extreme learning machine

K Han, D Yu, I Tashev - Interspeech 2014, 2014 - microsoft.com
Speech emotion recognition is a challenging problem partly because it is unclear what
features are effective for the task. In this paper we propose to utilize deep neural networks …

Deep learning: methods and applications

L Deng, D Yu - Foundations and trends® in signal processing, 2014 - nowpublishers.com
This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

Joint optimization of masks and deep recurrent neural networks for monaural source separation

PS Huang, M Kim… - … on Audio, Speech …, 2015 - ieeexplore.ieee.org
Monaural source separation is important for many real world applications. It is challenging
because, with only a single channel of information available, without any constraints, an …

Recent advances in deep learning for speech research at Microsoft

L Deng, J Li, JT Huang, K Yao, D Yu… - … on acoustics, speech …, 2013 - ieeexplore.ieee.org
Deep learning is becoming a mainstream technology for speech recognition at industrial
scale. In this paper, we provide an overview of the work by Microsoft speech researchers …