作者
Jean-Luc Gauvain, Lori Lamel, Gilles Adda
发表日期
2002/5/1
期刊
Speech communication
卷号
37
期号
1-2
页码范围
89-108
出版商
North-Holland
简介
This paper reports on activites at LIMSI over the last few years directed at the transcription of broadcast news data. We describe our development work in moving from laboratory read speech data to real-world or `found' speech data in preparation for the DARPA evaluations on this task from 1996 to 1999. Two main problems needed to be addressed to deal with the continuous flow of inhomogenous data. These concern the varied acoustic nature of the signal (signal quality, environmental and transmission noise, music) and different linguistic styles (prepared and spontaneous speech on a wide range of topics, spoken by a large variety of speakers). The problem of partitioning the continuous stream of data is addressed using an iterative segmentation and clustering algorithm with Gaussian mixtures. The speech recognizer makes use of continuous density HMMs with Gaussian mixture for acoustic modeling and 4 …
引用总数
2002200320042005200620072008200920102011201220132014201520162017201820192020202120222023722595847423822302635272923151398101045
学术搜索中的文章