Initialization of fMLLR with sufficient statistics from similar speakers

Z Zajíc, M Kunešová, V Radová - … 2016, Budapest, Hungary, August 23-27 …, 2016 - Springer

The goal of this paper is to evaluate the contribution of speaker change detection (SCD) to
the performance of a speaker diarization system in the telephone domain. We compare the …

被引用次数：27 相关文章所有 9 个版本

[PDF] ntu.edu.sg

Feature adaptation using linear spectro-temporal transform for robust speech recognition

DHH Nguyen, X Xiao, ES Chng… - IEEE/ACM Transactions …, 2016 - ieeexplore.ieee.org

Spectral information represents short-term speech information within a frame of a few tens of
milliseconds, while temporal information captures the evolution of speech statistics over …

被引用次数：17 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs.

A Prazák, Z Loose, J Trmal, JV Psutka, J Psutka - INTERSPEECH, 2012 - isca-archive.org

A novel approach to the live captioning through re-speaking is introduced in this paper. We
describe our concept of respeaking using only one re-speaker with enhanced re-speaker …

被引用次数：21 相关文章所有 3 个版本

[PDF] zcu.cz

Experiments with segmentation in an online speaker diarization system

M Kunešová, Z Zajíc, V Radová - … , Prague, Czech Republic, August 27-31 …, 2017 - Springer

In offline speaker diarization systems, particularly those aimed at telephone speech, the
accuracy of the initial segmentation of a conversation is often a secondary concern …

被引用次数：8 相关文章所有 4 个版本

[PDF] psu.edu

Captioning of live TV programs through speech recognition and re-speaking

A Pražák, Z Loose, J Trmal, JV Psutka… - Text, Speech and …, 2012 - Springer

In this paper we introduce our complete solution for captioning of live TV programs used by
the Czech Television, the public service broadcaster in the Czech Republic. Live captioning …

被引用次数：8 相关文章所有 4 个版本

[PDF] core.ac.uk

[PDF][PDF] 'DID THE SPEAKER CHANGE?': TEMPORAL TRACKING FOR OVERLAPPING SPEAKER

SINMS SCENARIOS - 2022 - core.ac.uk

Diarization systems are an essential part of many speech processing applications, such as
speaker indexing, improving automatic speech recognition (ASR) performance and making …

[PDF] psu.edu

Robust adaptation techniques dealing with small amount of data

Z Zajíc, L Machlica, L Müller - … Conference on Text, Speech and Dialogue, 2012 - Springer

The worst problem the adaptation is dealing with is the lack of adaptation data. This work
focuses on the feature Maximum Likelihood Linear Regression (fMLLR) adaptation where …

被引用次数：4 相关文章所有 6 个版本

[PDF] zcu.cz

Bottleneck ANN: Dealing with small amount of data in shift-MLLR adaptation

Z Zajíc, L Machlica, L Müller - 2012 IEEE 11th International …, 2012 - ieeexplore.ieee.org

The aim of this work is to propose a refinement of the shift-MLLR (shift Maximum Likelihood
Linear Regression) adaptation of an acoustics model in the case of limited amount of …

被引用次数：3 相关文章所有 7 个版本

Convolutional neural network for refinement of speaker adaptation transformation

Z Zajíc, J Zelinka, J Vaněk, L Müller - … 2014, Novi Sad, Serbia, October 5-9 …, 2014 - Springer

The aim of this work is to propose a refinement of the shift-MLLR (shift Maximum Likelihood
Linear Regression) adaptation of an acoustics model in the case of limited amount of …

被引用次数：2 相关文章所有 3 个版本

[PDF] zcu.cz

Vysokodimenzionální prostory a modelování v úloze rozpoznávání řečníka

L Machlica - 2013 - otik.uk.zcu.cz

The automatic speaker recognition made a significant progress in the last two decades.
Huge speech corpora containing thousands of speakers recorded on several channels are …

被引用次数：1 相关文章所有 2 个版本