查看文章

ustc.edu.cn 中的 [PDF]

A regression approach to speech enhancement based on deep neural networks

作者

Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee

发表日期

2014/10/21

期刊

IEEE/ACM transactions on audio, speech, and language processing

卷号

期号

页码范围

7-19

出版商

IEEE

简介

In contrast to the conventional minimum mean square error (MMSE)-based noise reduction techniques, we propose a supervised method to enhance speech by means of finding a mapping function between noisy and clean speech signals based on deep neural networks (DNNs). In order to be able to handle a wide range of additive noises in real-world situations, a large training set that encompasses many possible combinations of speech and noise types, is first designed. A DNN architecture is then employed as a nonlinear regression function to ensure a powerful modeling capability. Several techniques have also been proposed to improve the DNN-based speech enhancement system, including global variance equalization to alleviate the over-smoothing problem of the regression model, and the dropout and noise-aware training strategies to further improve the generalization capability of DNNs to unseen …

引用总数

被引用次数：1491

201520162017201820192020202120222023202420 84 116 174 188 215 208 194 190 98

学术搜索中的文章

A regression approach to speech enhancement based on deep neural networks

Y Xu, J Du, LR Dai, CH Lee - IEEE/ACM transactions on audio, speech, and …, 2014

被引用次数：1491 相关文章所有 8 个版本