查看文章

epfl.ch 中的 [PDF]

Analysis of phone posterior feature space exploiting class-specific sparsity and MLP-based similarity measure

作者

Afsaneh Asaei, Benjamin Picart, Hervé Bourlard

发表日期

2010/3/14

研讨会论文

2010 IEEE International Conference on Acoustics, Speech and Signal Processing

页码范围

4886-4889

出版商

IEEE

简介

Class posterior distributions have recently been used quite successfully in Automatic Speech Recognition (ASR), either for frame or phone level classification or as acoustic features, which can be further exploited (usually after some “ad hoc” transformations) in different classifiers (e.g., in Gaussian Mixture based HMMs). In the present paper, we show preliminary results showing that it may be possible to perform speech recognition without explicit subword unit (phone) classification or likelihood estimation, simply answering the question whether two acoustic (posterior) vectors belong to the same subword unit class or not. In this paper, we first exhibit specific properties of the posterior acoustic space before showing how those properties can be exploited to reach very high performance in deciding (based on an appropriate, trained, distance metric, and hypothesis testing approaches) whether two posterior vectors …

引用总数

被引用次数：25

20092010201120122013201420152016201720182019202020211 1 3 3 6 6 1 3 1

学术搜索中的文章

Analysis of phone posterior feature space exploiting class-specific sparsity and MLP-based similarity measure

A Asaei, B Picart, H Bourlard - 2010 IEEE International Conference on Acoustics …, 2010

被引用次数：20 相关文章所有 14 个版本

Improved phone posterior estimation through k-NN and MLP-based similarity*

B Picart - 2009

被引用次数：8 相关文章所有 7 个版本