Sparse hidden Markov models for purer clusters

F Deng, C Bao, WB Kleijn - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org

We propose a sparse hidden Markov model (HMM)-based single-channel speech
enhancement method that models the speech and noise gains accurately in non-stationary …

被引用次数：42 相关文章所有 3 个版本

[PDF] nsf.gov

A DNN-HMM-DNN hybrid model for discovering word-like units from spoken captions and image regions

L Wang, M Hasegawa-Johnson - Interspeech, 2020 - par.nsf.gov

Discovering word-like units without textual transcriptions is an important step in low-resource
speech technology. In this work, we demonstrate a model inspired by statistical machine …

被引用次数：12 相关文章所有 9 个版本

[PDF] isca-archive.org

[PDF][PDF] Multimodal Word Discovery and Retrieval with Phone Sequence and Image Concepts.

L Wang, MA Hasegawa-Johnson - INTERSPEECH, 2019 - isca-archive.org

This paper demonstrates three different systems capable of performing the multimodal word
discovery task. A multimodal word discovery system accepts, as input, a database of spoken …

被引用次数：6 相关文章所有 5 个版本

Sparse HMM-based speech enhancement method for stationary and non-stationary noise environments

F Deng, C Bao, WB Kleijn - 2015 IEEE International …, 2015 - ieeexplore.ieee.org

We propose a sparse hidden Markov model (HMM)-based single-channel speech
enhancement method that models the speech and noise gains accurately in both stationary …

被引用次数：10 相关文章所有 3 个版本

[PDF] aclanthology.org

[PDF][PDF] A PAC-Bayesian approach to minimum perplexity language modeling

S Bharadwaj… - Proceedings of COLING …, 2014 - aclanthology.org

Despite the overwhelming use of statistical language models in speech recognition,
machine translation, and several other domains, few high probability guarantees exist on …

被引用次数：3 相关文章所有 4 个版本

[PDF] nsf.gov

Multimodal word discovery and retrieval with spoken descriptions and visual concepts

L Wang, M Hasegawa-Johnson - IEEE/ACM Transactions on …, 2020 - ieeexplore.ieee.org

In the absence of dictionaries, translators, or grammars, it is still possible to learn some of
the words of a new language by listening to spoken descriptions of images. If several …

被引用次数：1 相关文章所有 4 个版本

[PDF] illinois.edu

[图书][B] A theory of (almost) zero resource speech recognition

SS Bharadwaj - 2015 - search.proquest.com

Automatic speech recognition has matured into a commercially successful technology,
enabling voice-based interfaces for smartphones, smart TVs, and many other consumer …

被引用次数：2 相关文章所有 2 个版本

[PDF] illinois.edu

Dynamics on networks

L Wang - 2020 - ideals.illinois.edu

Abstract" The main focus of this thesis is to study the stability of fix points for a dynamical
system. In the first part, we consider two dynamical models whose underlying graph can be …

An Approach for Speech Recognition Technique

MS Basha, BR Subbaiah… - i-manager's Journal on …, 2014 - search.proquest.com

Abstract The design of Speech Recognition system has careful attention in the following
issues: classification of various types of speech classes, speech representation, and feature …

[PDF] illinois.edu

[PDF][PDF] Mark Hasegawa-Johnson

S Xi, PB Kappa - linguistics.illinois.edu

2. T. Taniguchi, MA Johnson, and Y. Ohta,“Multi-vector pitch-orthogonal LPC: quality speech
with low complexity at rates between 4 and 8 kbps,” ICSLP, Kobe, pp. 113-116, 1990. 3. MA …