A review of deep learning techniques for speech processing
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …
learning. The use of multiple processing layers has enabled the creation of models capable …
Music source separation with band-split RNN
The performance of music source separation (MSS) models has been greatly improved in
recent years thanks to the development of novel neural network architectures and training …
recent years thanks to the development of novel neural network architectures and training …
Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis
P Ochieng - Artificial Intelligence Review, 2023 - Springer
Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …
natural language processing and computer vision. They have achieved great success in …
The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement
Supervised speech enhancement models are trained using artificially generated mixtures of
clean speech and noise signals, which may not match real-world recording conditions at test …
clean speech and noise signals, which may not match real-world recording conditions at test …
The intel neuromorphic DNS challenge
A critical enabler for progress in neuromorphic computing research is the ability to
transparently evaluate different neuromorphic solutions on important tasks and to compare …
transparently evaluate different neuromorphic solutions on important tasks and to compare …
UNSSOR: unsupervised neural speech separation by leveraging over-determined training mixtures
ZQ Wang, S Watanabe - Advances in Neural Information …, 2024 - proceedings.neurips.cc
In reverberant conditions with multiple concurrent speakers, each microphone acquires a
mixture signal of multiple speakers at a different location. In over-determined conditions …
mixture signal of multiple speakers at a different location. In over-determined conditions …
Exploring wavlm on speech enhancement
There is a surge in interest in self-supervised learning approaches for end-to-end speech
encoding in recent years as they have achieved great success. Especially, WavLM showed …
encoding in recent years as they have achieved great success. Especially, WavLM showed …
Efficient monaural speech enhancement with universal sample rate band-split RNN
While recent developments on the design of neural networks have greatly advanced the
state-of-the-art of speech enhancement and separation systems, practical applications of …
state-of-the-art of speech enhancement and separation systems, practical applications of …
Tokensplit: Using discrete speech representations for direct, refined, and transcript-conditioned speech separation and recognition
We present TokenSplit, a speech separation model that acts on discrete token sequences.
The model is trained on multiple tasks simultaneously: separate and transcribe each speech …
The model is trained on multiple tasks simultaneously: separate and transcribe each speech …
Speech separation with large-scale self-supervised learning
Self-supervised learning (SSL) methods such as WavLM have shown promising speech
separation (SS) results in small-scale simulation-based experiments. In this work, we extend …
separation (SS) results in small-scale simulation-based experiments. In this work, we extend …