The cone of silence: Speech separation by localization
Given a multi-microphone recording of an unknown number of speakers talking
concurrently, we simultaneously localize the sources and separate the individual speakers …
concurrently, we simultaneously localize the sources and separate the individual speakers …
Unsupervised speech enhancement based on multichannel NMF-informed beamforming for noise-robust automatic speech recognition
This paper describes multichannel speech enhancement for improving automatic speech
recognition (ASR) in noisy environments. Recently, the minimum variance distortionless …
recognition (ASR) in noisy environments. Recently, the minimum variance distortionless …
Semi-supervised multichannel speech enhancement with a deep speech prior
This paper describes a semi-supervised multichannel speech enhancement method that
uses clean speech data for prior training. Although multichannel nonnegative matrix …
uses clean speech data for prior training. Although multichannel nonnegative matrix …
Bayesian multichannel speech enhancement with a deep speech prior
This paper describes statistical multichannel speech enhancement based on a deep
generative model of speech spectra. Recently, deep neural networks (DNNs) have widely …
generative model of speech spectra. Recently, deep neural networks (DNNs) have widely …
Minimum-volume multichannel nonnegative matrix factorization for blind audio source separation
Multichannel blind audio source separation aims to recover the latent sources from their
multichannel mixtures without supervised information. One state-of-the-art blind audio …
multichannel mixtures without supervised information. One state-of-the-art blind audio …
Sound source localization using relative harmonic coefficients in modal domain
Y Hu, PN Samarasinghe… - 2019 IEEE Workshop on …, 2019 - ieeexplore.ieee.org
This paper proposes a data-driven source localization approach under a noisy and
reverberant environment, using a newly defined feature named relative harmonic …
reverberant environment, using a newly defined feature named relative harmonic …
ICA and IVA bounded multivariate generalized Gaussian mixture based hidden Markov models
Abstract Machine learning (ML), a branch of artificial intelligence (AI), is an area of
computational science that is concerned with the analysis and interpretation of patterns and …
computational science that is concerned with the analysis and interpretation of patterns and …
Deep ad-hoc beamforming based on speaker extraction for target-dependent speech separation
Recently, the research on ad-hoc microphone arrays with deep learning has drawn much
attention, especially in speech enhancement and separation. An ad-hoc microphone array …
attention, especially in speech enhancement and separation. An ad-hoc microphone array …
Synchronization of microphones based on rank minimization of warped spectrum for asynchronous distributed recording
This paper describes a new method for synchronizing microphones based on spectral
warping in an asynchronous microphone array. In an audio signal observed by an …
warping in an asynchronous microphone array. In an audio signal observed by an …
Single Channel multi-speaker speech Separation based on quantized ratio mask and residual network
S Ke, R Hu, X Wang, T Wu, G Li, Z Wang - Multimedia Tools and …, 2020 - Springer
The recently-proposed deep clustering-based algorithms represent a fundamental advance
towards the single-channel multi-speaker speech sep-aration problem. These methods use …
towards the single-channel multi-speaker speech sep-aration problem. These methods use …