查看文章

epfl.ch 中的 [PDF]

Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition

作者

Pranay Dighe, Afsaneh Asaei, Hervé Bourlard

发表日期

2016

期刊

Speech Communication

卷号

页码范围

230-244

出版商

North-Holland

简介

In this paper, a compressive sensing (CS) perspective to exemplar-based speech processing is proposed. Relying on an analytical relationship between CS formulation and statistical speech recognition (Hidden Markov Models – HMM), the automatic speech recognition (ASR) problem is cast as recovery of high-dimensional sparse word representation from the observed low-dimensional acoustic features. The acoustic features are exemplars obtained from (deep) neural network sub-word conditional posterior probabilities. Low-dimensional word manifolds are learned using these sub-word posterior exemplars and exploited to construct a linguistic dictionary for sparse representation of word posteriors. Dictionary learning has been found to be a principled way to alleviate the need of having huge collection of exemplars as required in conventional exemplar-based approaches, while still improving the performance …

引用总数

被引用次数：29

20152016201720182019202020212022202320241 8 5 4 2 3 2 1 2 1

学术搜索中的文章

Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition

P Dighe, A Asaei, H Bourlard - Speech Communication, 2016

被引用次数：29 相关文章所有 9 个版本