作者
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee
发表日期
2014/10/21
期刊
IEEE/ACM transactions on audio, speech, and language processing
卷号
23
期号
1
页码范围
7-19
出版商
IEEE
简介
In contrast to the conventional minimum mean square error (MMSE)-based noise reduction techniques, we propose a supervised method to enhance speech by means of finding a mapping function between noisy and clean speech signals based on deep neural networks (DNNs). In order to be able to handle a wide range of additive noises in real-world situations, a large training set that encompasses many possible combinations of speech and noise types, is first designed. A DNN architecture is then employed as a nonlinear regression function to ensure a powerful modeling capability. Several techniques have also been proposed to improve the DNN-based speech enhancement system, including global variance equalization to alleviate the over-smoothing problem of the regression model, and the dropout and noise-aware training strategies to further improve the generalization capability of DNNs to unseen …
引用总数
2015201620172018201920202021202220232024208411617418821520819419098
学术搜索中的文章
Y Xu, J Du, LR Dai, CH Lee - IEEE/ACM transactions on audio, speech, and …, 2014