Remixit: Continual self-training of speech enhancement models via bootstrapped remixing

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

被引用次数：100 相关文章所有 6 个版本

[PDF] arxiv.org

Music source separation with band-split RNN

Y Luo, J Yu - IEEE/ACM Transactions on Audio, Speech, and …, 2023 - ieeexplore.ieee.org

The performance of music source separation (MSS) models has been greatly improved in
recent years thanks to the development of novel neural network architectures and training …

被引用次数：65 相关文章所有 4 个版本

[PDF] springer.com

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

P Ochieng - Artificial Intelligence Review, 2023 - Springer

Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …

被引用次数：14 相关文章所有 8 个版本

[PDF] arxiv.org

The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement

S Leglaive, L Borne, E Tzinis, M Sadeghi… - arXiv preprint arXiv …, 2023 - arxiv.org

Supervised speech enhancement models are trained using artificially generated mixtures of
clean speech and noise signals, which may not match real-world recording conditions at test …

被引用次数：12 相关文章所有 17 个版本

[PDF] iop.org Full View

The intel neuromorphic DNS challenge

J Timcheck, SB Shrestha, DBD Rubin… - Neuromorphic …, 2023 - iopscience.iop.org

A critical enabler for progress in neuromorphic computing research is the ability to
transparently evaluate different neuromorphic solutions on important tasks and to compare …

被引用次数：17 相关文章所有 3 个版本

[PDF] neurips.cc

UNSSOR: unsupervised neural speech separation by leveraging over-determined training mixtures

ZQ Wang, S Watanabe - Advances in Neural Information …, 2024 - proceedings.neurips.cc

In reverberant conditions with multiple concurrent speakers, each microphone acquires a
mixture signal of multiple speakers at a different location. In over-determined conditions …

被引用次数：6 相关文章所有 8 个版本

[PDF] arxiv.org

Exploring wavlm on speech enhancement

H Song, S Chen, Z Chen, Y Wu… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

There is a surge in interest in self-supervised learning approaches for end-to-end speech
encoding in recent years as they have achieved great success. Especially, WavLM showed …

被引用次数：15 相关文章所有 4 个版本

Efficient monaural speech enhancement with universal sample rate band-split RNN

J Yu, Y Luo - … 2023-2023 IEEE International Conference on …, 2023 - ieeexplore.ieee.org

While recent developments on the design of neural networks have greatly advanced the
state-of-the-art of speech enhancement and separation systems, practical applications of …

被引用次数：10 相关文章

[PDF] arxiv.org

Tokensplit: Using discrete speech representations for direct, refined, and transcript-conditioned speech separation and recognition

H Erdogan, S Wisdom, X Chang, Z Borsos… - arXiv preprint arXiv …, 2023 - arxiv.org

We present TokenSplit, a speech separation model that acts on discrete token sequences.
The model is trained on multiple tasks simultaneously: separate and transcribe each speech …

被引用次数：5 相关文章所有 6 个版本

[PDF] arxiv.org

Speech separation with large-scale self-supervised learning

Z Chen, N Kanda, J Wu, Y Wu, X Wang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Self-supervised learning (SSL) methods such as WavLM have shown promising speech
separation (SS) results in small-scale simulation-based experiments. In this work, we extend …

被引用次数：9 相关文章所有 3 个版本