- 学术资源搜索

EEND-SS: Joint end-to-end neural speaker diarization and speech separation for flexible number of speakers

S Maiti, Y Ueda, S Watanabe, C Zhang… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

In this paper, we present a novel framework that jointly performs three tasks: speaker
diarization, speech separation, and speaker counting. Our proposed framework integrates …

被引用次数：31 相关文章所有 5 个版本

[PDF] ieee.org

Speaker counting and separation from single-channel noisy mixtures

SR Chetupalli, EAP Habets - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org

We address the problem of speaker counting and separation from a noisy, single-channel,
multi-source, recording. Most of the works in the literature assume mixtures containing two to …

被引用次数：11 相关文章所有 3 个版本

[PDF] arxiv.org

A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and Extraction

K Saijo, W Zhang, ZQ Wang… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

We propose a multi-task universal speech enhancement (MUSE) model that can perform
five speech enhancement (SE) tasks: dereverberation, denoising, speech separation (SS) …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Multi-microphone speaker separation by spatial regions

J Wechsler, SR Chetupalli, W Mack… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

We consider the task of region-based source separation of reverberant multi-microphone
recordings. We assume pre-defined spatial regions with a single active source per region …

被引用次数：9 相关文章所有 4 个版本

[PDF] arxiv.org

Boosting Unknown-Number Speaker Separation with Transformer Decoder-Based Attractor

Y Lee, S Choi, BY Kim, ZQ Wang… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

We propose a novel speech separation model designed to separate mixtures with an
unknown number of speakers. The proposed model stacks 1) a dual-path processing block …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation

Y Bando, Y Masuyama, AA Nugraha… - 2023 31st European …, 2023 - ieeexplore.ieee.org

This paper describes an efficient unsupervised learning method for a neural source
separation model that utilizes a probabilistic generative model of observed multichannel …

被引用次数：4 相关文章所有 4 个版本

[PDF] uni-bremen.de

Target language extraction at multilingual cocktail parties

M Borsdorf, H Li, T Schultz - 2021 IEEE Automatic Speech …, 2021 - ieeexplore.ieee.org

Typically, target speaker extraction seeks to extract a target speaker's contribution according
to his or her individual voice characteristics. In a “multilingual cocktail party” however …

被引用次数：9 相关文章所有 5 个版本

[PDF] isca-archive.org

[PDF][PDF] Speech Separation for an Unknown Number of Speakers Using Transformers With Encoder-Decoder Attractors.

SR Chetupalli, EAP Habets - INTERSPEECH, 2022 - isca-archive.org

Speaker-independent speech separation for single-channel mixtures with an unknown
number of multiple speakers in the waveform domain is considered in this paper. To deal …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Tacnet: Temporal audio source counting network

A Ahmadnejad, AM Darviishani, MM Asadi… - arXiv preprint arXiv …, 2023 - arxiv.org

In this paper, we introduce the Temporal Audio Source Counting Network (TaCNet), an
innovative architecture that addresses limitations in audio source counting tasks. TaCNet …

被引用次数：3 相关文章所有 4 个版本

[PDF] mdpi.com

Ultrasonic Through-Metal Communication Based on Deep-Learning-Assisted Echo Cancellation

J Zhang, M Jiang, J Zhang, M Gu, Z Cao - Sensors, 2024 - mdpi.com

Ultrasound is extremely efficient for wireless signal transmission through metal barriers due
to no limit of the Faraday shielding effect. Echoing in the ultrasonic channel is one of the …

被引用次数：1 相关文章所有 7 个版本