[图书][B] Audio source separation and speech enhancement

E Vincent, T Virtanen, S Gannot - 2018 - books.google.com
Learn the technology behind hearing aids, Siri, and Echo Audio source separation and
speech enhancement aim to extract one or more source signals of interest from an audio …

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org
Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

Multiple-speaker localization based on direct-path features and likelihood maximization with spatial sparsity regularization

X Li, L Girin, R Horaud, S Gannot - IEEE/ACM Transactions on …, 2017 - ieeexplore.ieee.org
This paper addresses the problem of multiple-speaker localization in noisy and reverberant
environments, using binaural recordings of an acoustic scene. A complex-valued Gaussian …

A systematic review of structured sparse learning

L Qiao, B Zhang, J Su, X Lu - Frontiers of Information Technology & …, 2017 - Springer
High dimensional data arising from diverse scientific research fields and industrial
development have led to increased interest in sparse learning due to model parsimony and …

Improving end-to-end single-channel multi-talker speech recognition

W Zhang, X Chang, Y Qian… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Although significant progress has been made in single-talker automatic speech recognition
(ASR), there is still a large performance gap between multi-talker and single-talker speech …

Acoustic reflector localization: Novel image source reversion and direct localization methods

L Remaggi, PJB Jackson, P Coleman… - … /ACM Transactions on …, 2016 - ieeexplore.ieee.org
Acoustic reflector localization is an important issue in audio signal processing, with direct
applications in spatial audio, scene reconstruction, and source separation. Several methods …

Automatic intelligibility assessment of dysarthric speech using phonologically-structured sparse linear model

MJ Kim, Y Kim, H Kim - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
This paper presents a new method for automatically assessing the speech intelligibility of
patients with dysarthria, which is a motor speech disorder impeding the physical production …

Low-frequency sound source localization in enclosed space based on time reversal method

H Ma, T Shang, G Li, Z Li - Measurement, 2022 - Elsevier
This study investigates sound source localization (SSL) in enclosed space at low
frequencies. Reverberation and noise are important factors that affect localization result …

CNN-QTLBO: an optimal blind source separation and blind dereverberation scheme using lightweight CNN-QTLBO and PCDP-LDA for speech mixtures

JJC Sheeja, B Sankaragomathi - Signal, Image and Video Processing, 2022 - Springer
A microphone positioned far away observes speech signals with little acoustic interference,
in terms of both reverberation and noise. As a result, the quality of blind speech degrades …

Physics-driven inverse problems made tractable with cosparse regularization

S Kitić, L Albera, N Bertin… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Sparse data models are powerful tools for solving ill-posed inverse problems. We present a
regularization framework based on the sparse synthesis and sparse analysis models for …