Analysis and processing of lecture audio data: Preliminary investigations

JR Glass, TJ Hazen, DS Cyphers, I Malioutov… - Interspeech, 2007 - isca-archive.org

In this paper we discuss our research activities in the area of spoken lecture processing. Our
goal is to improve the access to on-line audio/visual recordings of academic lectures by …

被引用次数：241 相关文章所有 14 个版本

[PDF] mit.edu

Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams

Y Zhang, JR Glass - 2009 IEEE Workshop on Automatic Speech …, 2009 - ieeexplore.ieee.org

In this paper, we present an unsupervised learning framework to address the problem of
detecting spoken keywords. Without any transcription information, a Gaussian Mixture Model …

被引用次数：448 相关文章所有 12 个版本

[PDF] ieee.org

Content based lecture video retrieval using speech and video text information

H Yang, C Meinel - IEEE transactions on learning technologies, 2014 - ieeexplore.ieee.org

In the last decade e-lecturing has become more and more popular. The amount of lecture
video data on the World Wide Web (WWW) is growing rapidly. Therefore, a more efficient …

被引用次数：247 相关文章所有 5 个版本

[PDF] mit.edu

Unsupervised pattern discovery in speech

AS Park, JR Glass - IEEE Transactions on Audio, Speech, and …, 2007 - ieeexplore.ieee.org

We present a novel approach to speech processing based on the principle of pattern
discovery. Our work represents a departure from traditional models of speech recognition …

被引用次数：424 相关文章所有 14 个版本

[PDF] googleapis.com

Systems and methods for implicitly interpreting semantically redundant communication modes

EC Kaiser - US Patent 8,457,959, 2013 - Google Patents

(*) Notice: Subject to any disclaimer, the term of this 2005. O197843 A1* 9, 2005 Faisman et
al. TO4, 276 patent is extended or adjusted under 35 2005/0203738 A1* 9/2005 …

被引用次数：300 相关文章所有 4 个版本

[PDF] arxiv.org

Automatic dialect detection in arabic broadcast speech

A Ali, N Dehak, P Cardinal, S Khurana, SH Yella… - arXiv preprint arXiv …, 2015 - arxiv.org

We investigate different approaches for dialect identification in Arabic broadcast speech,
using phonetic, lexical features obtained from a speech recognition system, and acoustic …

被引用次数：159 相关文章所有 17 个版本

[PDF] mit.edu

Unsupervised lexicon discovery from acoustic input

C Lee, TJ O'donnell, J Glass - Transactions of the Association for …, 2015 - direct.mit.edu

We present a model of unsupervised phonological lexicon discovery—the problem of
simultaneously learning phoneme-like and word-like units from acoustic input. Our model …

被引用次数：104 相关文章所有 18 个版本

[PDF] googleapis.com

Speech index pruning

CI Chelba, A Acero, JFS Sanchez - US Patent 7,831,428, 2010 - Google Patents

A speech segment is indexed by identifying at least two alternative word sequences for the
speech segment. For each word in the alternative sequences, information is placed in an …

被引用次数：160 相关文章所有 4 个版本

[PDF] arxiv.org

Spoken language biomarkers for detecting cognitive impairment

T Alhanai, R Au, J Glass - 2017 IEEE Automatic Speech …, 2017 - ieeexplore.ieee.org

In this study we developed an automated system that evaluates speech and language
features from audio recordings of neuropsychological examinations of 92 subjects in the …

被引用次数：44 相关文章所有 11 个版本

[PDF] mit.edu

Automatic processing of audio lectures for information retrieval: Vocabulary selection and language modeling

A Park, TJ Hazen, JR Glass - Proceedings.(ICASSP'05). IEEE …, 2005 - ieeexplore.ieee.org

This paper describes our initial progress towards developing a system for automatically
transcribing and indexing audio-visual academic lectures for audio information retrieval. We …

被引用次数：119 相关文章所有 13 个版本