Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM

W Jia, M Sun, J Lian, S Hou - Complex & Intelligent Systems, 2022 - Springer

As basic research, it has also received increasing attention from people that the “curse of
dimensionality” will lead to increase the cost of data storage and computing; it also …

被引用次数：313 相关文章所有 4 个版本

[PDF] ieee.org

Speech recognition using deep neural networks: A systematic review

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org

Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

被引用次数：1221 相关文章所有 9 个版本

[PDF] arxiv.org

Recent advances in convolutional neural networks

J Gu, Z Wang, J Kuen, L Ma, A Shahroudy, B Shuai… - Pattern recognition, 2018 - Elsevier

In the last few years, deep learning has led to very good performance on a variety of
problems, such as visual recognition, speech recognition and natural language processing …

被引用次数：6250 相关文章所有 7 个版本

[PDF] arxiv.org

A streaming on-device end-to-end model surpassing server-side conventional model quality and latency

TN Sainath, Y He, B Li, A Narayanan… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

Thus far, end-to-end (E2E) models have not been shown to outperform state-of-the-art
conventional models with respect to both quality, ie, word error rate (WER), and latency, ie …

被引用次数：223 相关文章所有 10 个版本

[PDF] microsoft.com

Convolutional neural networks for speech recognition

O Abdel-Hamid, A Mohamed, H Jiang… - … on audio, speech …, 2014 - ieeexplore.ieee.org

Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been
shown to significantly improve speech recognition performance over the conventional …

被引用次数：2843 相关文章所有 10 个版本

[PDF] academia.edu

[图书][B] Automatic speech recognition

D Yu, L Deng - 2016 - Springer

Automatic Speech Recognition (ASR), which is aimed to enable natural human–machine
interaction, has been an intensive research area for decades. Many core technologies, such …

被引用次数：1522 相关文章所有 9 个版本

[PDF] microsoft.com

Speech emotion recognition using deep neural network and extreme learning machine

K Han, D Yu, I Tashev - Interspeech 2014, 2014 - microsoft.com

Speech emotion recognition is a challenging problem partly because it is unclear what
features are effective for the task. In this paper we propose to utilize deep neural networks …

被引用次数：996 相关文章所有 8 个版本

[PDF] nowpublishers.com

Deep learning: methods and applications

L Deng, D Yu - Foundations and trends® in signal processing, 2014 - nowpublishers.com

This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

被引用次数：5902 相关文章所有 13 个版本

[PDF] arxiv.org

Joint optimization of masks and deep recurrent neural networks for monaural source separation

PS Huang, M Kim… - … on Audio, Speech …, 2015 - ieeexplore.ieee.org

Monaural source separation is important for many real world applications. It is challenging
because, with only a single channel of information available, without any constraints, an …

被引用次数：537 相关文章所有 8 个版本

[PDF] psu.edu

Recent advances in deep learning for speech research at Microsoft

L Deng, J Li, JT Huang, K Yao, D Yu… - … on acoustics, speech …, 2013 - ieeexplore.ieee.org

Deep learning is becoming a mainstream technology for speech recognition at industrial
scale. In this paper, we provide an overview of the work by Microsoft speech researchers …

被引用次数：1037 相关文章所有 8 个版本