Perceptual evaluation of blind source separation for robust speech recognition

V Emiya, E Vincent, N Harlander… - IEEE Transactions on …, 2011 - ieeexplore.ieee.org

We aim to assess the perceived quality of estimated source signals in the context of audio
source separation. These signals may involve one or more kinds of distortions, including …

被引用次数：425 相关文章所有 13 个版本

[PDF] columbia.edu

Model-based expectation-maximization source separation and localization

MI Mandel, RJ Weiss, DPW Ellis - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org

This paper describes a system, referred to as model-based expectation-maximization source
separation and localization (MESSL), for separating and localizing multiple sound sources …

被引用次数：388 相关文章所有 11 个版本

[PDF] ustc.edu.cn

A cross-entropy-guided measure (CEGM) for assessing speech recognition performance and optimizing DNN-based speech enhancement

L Chai, J Du, QF Liu, CH Lee - IEEE/ACM Transactions on …, 2020 - ieeexplore.ieee.org

A new cross-entropy-guided measure (CEGM) is proposed to indirectly assess accuracies of
automatic speech recognition (ASR) of degraded speech with a speech enhancement front …

被引用次数：58 相关文章所有 4 个版本

Audio watermark

Y Lin, WH Abdulla - Audio Watermark A Comprehensive Foundation …, 2015 - Springer

Audio watermarking is a technique providing a promising solution to copyrights protection
for digital audio and multimedia products. Using this technique, hidden information called …

被引用次数：79 相关文章所有 9 个版本

[PDF] springer.com

Localization based stereo speech source separation using probabilistic time-frequency masking and deep neural networks

Y Yu, W Wang, P Han - EURASIP Journal on Audio, Speech, and Music …, 2016 - Springer

Time-frequency (TF) masking is an effective method for stereo speech source separation.
However, reliable estimation of the TF mask from sound mixtures is a challenging task …

被引用次数：58 相关文章所有 11 个版本

[PDF] ru.nl

Subjective and objective quality assessment of single-channel speech separation algorithms

P Mowlaee, R Saeidi, MG Christensen… - … on acoustics, speech …, 2012 - ieeexplore.ieee.org

Previous studies on performance evaluation of single-channel speech separation (SCSS)
algorithms mostly focused on automatic speech recognition (ASR) accuracy as their …

被引用次数：49 相关文章所有 13 个版本

[PDF] researchgate.net

Joint mixing vector and binaural model based stereo source separation

A Alinaghi, PJB Jackson, Q Liu… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org

In this paper the mixing vector (MV) in the statistical mixing model is compared to the
binaural cues represented by interaural level and phase differences (ILD and IPD). It is …

被引用次数：45 相关文章所有 11 个版本

Bounded generalized Gaussian mixture model with ICA

M Azam, N Bouguila - Neural Processing Letters, 2019 - Springer

In this paper, we propose bounded generalized Gaussian mixture model with independent
component analysis (ICA). One limitation in ICA is that it assumes the sources to be …

被引用次数：27 相关文章所有 3 个版本

[PDF] psu.edu

Reverberant speech separation with probabilistic time–frequency masking for B-format recordings

X Chen, W Wang, Y Wang, X Zhong, A Alinaghi - Speech Communication, 2015 - Elsevier

Existing speech source separation approaches overwhelmingly rely on acoustic pressure
information acquired by using a microphone array. Little attention has been devoted to the …

被引用次数：36 相关文章所有 9 个版本

Proposing a robust RLS based subband adaptive filtering for audio noise cancellation

T Bahraini, AN Sadigh - Applied Acoustics, 2024 - Elsevier

The elimination or reduction of audio signal noise and interference is a significant challenge
in signal processing. Researchers have introduced various methods, including those based …

被引用次数：4 相关文章