[PDF][PDF] Recent progress in the MIT spoken lecture processing project.

JR Glass, TJ Hazen, DS Cyphers, I Malioutov… - Interspeech, 2007 - isca-archive.org
In this paper we discuss our research activities in the area of spoken lecture processing. Our
goal is to improve the access to on-line audio/visual recordings of academic lectures by …

Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams

Y Zhang, JR Glass - 2009 IEEE Workshop on Automatic Speech …, 2009 - ieeexplore.ieee.org
In this paper, we present an unsupervised learning framework to address the problem of
detecting spoken keywords. Without any transcription information, a Gaussian Mixture Model …

Content based lecture video retrieval using speech and video text information

H Yang, C Meinel - IEEE transactions on learning technologies, 2014 - ieeexplore.ieee.org
In the last decade e-lecturing has become more and more popular. The amount of lecture
video data on the World Wide Web (WWW) is growing rapidly. Therefore, a more efficient …

Unsupervised pattern discovery in speech

AS Park, JR Glass - IEEE Transactions on Audio, Speech, and …, 2007 - ieeexplore.ieee.org
We present a novel approach to speech processing based on the principle of pattern
discovery. Our work represents a departure from traditional models of speech recognition …

Systems and methods for implicitly interpreting semantically redundant communication modes

EC Kaiser - US Patent 8,457,959, 2013 - Google Patents
(*) Notice: Subject to any disclaimer, the term of this 2005. O197843 A1* 9, 2005 Faisman et
al. TO4, 276 patent is extended or adjusted under 35 2005/0203738 A1* 9/2005 …

Automatic dialect detection in arabic broadcast speech

A Ali, N Dehak, P Cardinal, S Khurana, SH Yella… - arXiv preprint arXiv …, 2015 - arxiv.org
We investigate different approaches for dialect identification in Arabic broadcast speech,
using phonetic, lexical features obtained from a speech recognition system, and acoustic …

Unsupervised lexicon discovery from acoustic input

C Lee, TJ O'donnell, J Glass - Transactions of the Association for …, 2015 - direct.mit.edu
We present a model of unsupervised phonological lexicon discovery—the problem of
simultaneously learning phoneme-like and word-like units from acoustic input. Our model …

Speech index pruning

CI Chelba, A Acero, JFS Sanchez - US Patent 7,831,428, 2010 - Google Patents
A speech segment is indexed by identifying at least two alternative word sequences for the
speech segment. For each word in the alternative sequences, information is placed in an …

Spoken language biomarkers for detecting cognitive impairment

T Alhanai, R Au, J Glass - 2017 IEEE Automatic Speech …, 2017 - ieeexplore.ieee.org
In this study we developed an automated system that evaluates speech and language
features from audio recordings of neuropsychological examinations of 92 subjects in the …

Automatic processing of audio lectures for information retrieval: Vocabulary selection and language modeling

A Park, TJ Hazen, JR Glass - Proceedings.(ICASSP'05). IEEE …, 2005 - ieeexplore.ieee.org
This paper describes our initial progress towards developing a system for automatically
transcribing and indexing audio-visual academic lectures for audio information retrieval. We …