Subjective and objective quality assessment of audio source separation
We aim to assess the perceived quality of estimated source signals in the context of audio
source separation. These signals may involve one or more kinds of distortions, including …
source separation. These signals may involve one or more kinds of distortions, including …
Model-based expectation-maximization source separation and localization
This paper describes a system, referred to as model-based expectation-maximization source
separation and localization (MESSL), for separating and localizing multiple sound sources …
separation and localization (MESSL), for separating and localizing multiple sound sources …
A cross-entropy-guided measure (CEGM) for assessing speech recognition performance and optimizing DNN-based speech enhancement
A new cross-entropy-guided measure (CEGM) is proposed to indirectly assess accuracies of
automatic speech recognition (ASR) of degraded speech with a speech enhancement front …
automatic speech recognition (ASR) of degraded speech with a speech enhancement front …
Audio watermark
Y Lin, WH Abdulla - Audio Watermark A Comprehensive Foundation …, 2015 - Springer
Audio watermarking is a technique providing a promising solution to copyrights protection
for digital audio and multimedia products. Using this technique, hidden information called …
for digital audio and multimedia products. Using this technique, hidden information called …
Localization based stereo speech source separation using probabilistic time-frequency masking and deep neural networks
Y Yu, W Wang, P Han - EURASIP Journal on Audio, Speech, and Music …, 2016 - Springer
Time-frequency (TF) masking is an effective method for stereo speech source separation.
However, reliable estimation of the TF mask from sound mixtures is a challenging task …
However, reliable estimation of the TF mask from sound mixtures is a challenging task …
Subjective and objective quality assessment of single-channel speech separation algorithms
Previous studies on performance evaluation of single-channel speech separation (SCSS)
algorithms mostly focused on automatic speech recognition (ASR) accuracy as their …
algorithms mostly focused on automatic speech recognition (ASR) accuracy as their …
Joint mixing vector and binaural model based stereo source separation
In this paper the mixing vector (MV) in the statistical mixing model is compared to the
binaural cues represented by interaural level and phase differences (ILD and IPD). It is …
binaural cues represented by interaural level and phase differences (ILD and IPD). It is …
Bounded generalized Gaussian mixture model with ICA
M Azam, N Bouguila - Neural Processing Letters, 2019 - Springer
In this paper, we propose bounded generalized Gaussian mixture model with independent
component analysis (ICA). One limitation in ICA is that it assumes the sources to be …
component analysis (ICA). One limitation in ICA is that it assumes the sources to be …
Reverberant speech separation with probabilistic time–frequency masking for B-format recordings
X Chen, W Wang, Y Wang, X Zhong, A Alinaghi - Speech Communication, 2015 - Elsevier
Existing speech source separation approaches overwhelmingly rely on acoustic pressure
information acquired by using a microphone array. Little attention has been devoted to the …
information acquired by using a microphone array. Little attention has been devoted to the …
Proposing a robust RLS based subband adaptive filtering for audio noise cancellation
T Bahraini, AN Sadigh - Applied Acoustics, 2024 - Elsevier
The elimination or reduction of audio signal noise and interference is a significant challenge
in signal processing. Researchers have introduced various methods, including those based …
in signal processing. Researchers have introduced various methods, including those based …