Efficient audio stream segmentation via the combined T/sup 2/statistic and Bayesian information criterion

B Zhou, JHL Hansen - IEEE Transactions on Speech and …, 2005 - ieeexplore.ieee.org
In many speech and audio applications, it is first necessary to partition and classify acoustic
events prior to voice coding for communication or speech recognition for spoken document …

Segmentation, diarization and speech transcription: surprise data unraveled

MAH Huijbregts - 2008 - research.utwente.nl
In this thesis, research on large vocabulary continuous speech recognition for unknown
audio conditions is presented. For automatic speech recognition systems based on …

[PDF][PDF] Speaker diarisation for broadcast news.

SE Tranter, DA Reynolds - Odyssey, 2004 - Citeseer
It is often important to be able to automatically label 'who spoke when'during some audio
data. This paper describes two systems for audio segmentation developed at CUED and MIT …

Mandarin–English information (MEI): investigating translingual speech retrieval

HM Meng, B Chen, S Khudanpur, GA Levow… - Computer Speech & …, 2004 - Elsevier
This paper describes the Mandarin–English Information (MEI) project, where we
investigated the problem of cross-language spoken document retrieval (CL-SDR), and …

Word topic models for spoken document retrieval and transcription

B Chen - ACM Transactions on Asian Language Information …, 2009 - dl.acm.org
Statistical language modeling (LM), which aims to capture the regularities in human natural
language and quantify the acceptability of a given word sequence, has long been an …

A discriminative HMM/N-gram-based retrieval approach for Mandarin spoken documents

B Chen, H Wang, L Lee - ACM Transactions on Asian Language …, 2004 - dl.acm.org
In recent years, statistical modeling approaches have steadily gained in popularity in the
field of information retrieval. This article presents an HMM/N-gram-based retrieval approach …

[PDF][PDF] Broadcast news transcription in Mandarin.

L Chen, L Lamel, G Adda, JL Gauvain - INTERSPEECH, 2000 - isca-archive.org
In this paper, our work in developing a Mandarin broadcast news transcription system is
described. The main focus of this work is a port of the LIMSI American English broadcast …

[PDF][PDF] Improved spoken document retrieval by exploring extra acoustic and linguistic cues

B Chen, H Wang, L Lee - … European Conference on …, 2001 - homepage.iis.sinica.edu.tw
In this paper, we explored the use of various extra information to improve the performance of
spoken document retrieval (SDR). From the speech recognition perspective, we …

Exploring the use of latent topical information for statistical Chinese spoken document retrieval

B Chen - Pattern Recognition Letters, 2006 - Elsevier
Information retrieval which aims to provide people with easy access to all kinds of
information is now becoming more and more emphasized. However, most approaches to …

Multi-scale-audio indexing for translingual spoken document retrieval

H Wang, H Meng, P Schone, B Chen… - 2001 IEEE International …, 2001 - ieeexplore.ieee.org
MEI (Mandarin-English Information) is an English-Chinese crosslingual spoken document
retrieval (CL-SDR) system developed during the Johns Hopkins University Summer …