作者
B Yegnanarayana, SR Mahadeva Prasanna, Ramani Duraiswami, Dmitry Zotkin
发表日期
2005/10/17
期刊
IEEE Transactions on Speech and Audio Processing
卷号
13
期号
6
页码范围
1110-1118
出版商
IEEE
简介
In this paper, we present a method of extracting the time-delay between speech signals collected at two microphone locations. Time-delay estimation from microphone outputs is the first step for many sound localization algorithms, and also for enhancement of speech. For time-delay estimation, speech signals are normally processed using short-time spectral information (either magnitude or phase or both). The spectral features are affected by degradations in speech caused by noise and reverberation. Features corresponding to the excitation source of the speech production mechanism are robust to such degradations. We show that these source features can be extracted reliably from the speech signal. The time-delay estimate can be obtained using the features extracted even from short segments (50-100 ms) of speech from a pair of microphones. The proposed method for time-delay estimation is found to …
引用总数
20052006200720082009201020112012201320142015201620172018201920202021202220232359149156591036734133
学术搜索中的文章
B Yegnanarayana, SRM Prasanna, R Duraiswami… - IEEE Transactions on Speech and Audio Processing, 2005