A review of deep learning techniques for speech processing
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …
learning. The use of multiple processing layers has enabled the creation of models capable …
Speaker recognition based on deep learning: An overview
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …
learning has dramatically revolutionized speaker recognition. However, there is lack of …
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the
6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge …
6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge …
A survey of speaker recognition: Fundamental theories, recognition methods and opportunities
Humans can identify a speaker by listening to their voice, over the telephone, or on any
digital devices. Acquiring this congenital human competency, authentication technologies …
digital devices. Acquiring this congenital human competency, authentication technologies …
The chime-7 dasr challenge: Distant meeting transcription with multiple devices in diverse scenarios
The CHiME challenges have played a significant role in the development and evaluation of
robust speech recognition (ASR) systems. We introduce the CHiME-7 distant ASR (DASR) …
robust speech recognition (ASR) systems. We introduce the CHiME-7 distant ASR (DASR) …
Augmentation adversarial training for self-supervised speaker recognition
The goal of this work is to train robust speaker recognition models without speaker labels.
Recent works on unsupervised speaker representations are based on contrastive learning …
Recent works on unsupervised speaker representations are based on contrastive learning …
Augmented datasheets for speech datasets and ethical decision-making
Speech datasets are crucial for training Speech Language Technologies (SLT); however,
the lack of diversity of the underlying training data can lead to serious limitations in building …
the lack of diversity of the underlying training data can lead to serious limitations in building …
The voices from a distance challenge 2019 evaluation plan
The" VOiCES from a Distance Challenge 2019" is designed to foster research in the area of
speaker recognition and automatic speech recognition (ASR) with the special focus on …
speaker recognition and automatic speech recognition (ASR) with the special focus on …
The INTERSPEECH 2020 far-field speaker verification challenge
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge (FFSVC 2020)
addresses three different research problems under well-defined conditions: far-field text …
addresses three different research problems under well-defined conditions: far-field text …
[HTML][HTML] Novel speech recognition systems applied to forensics within child exploitation: Wav2vec2. 0 vs. whisper
JC Vásquez-Correa, A Álvarez Muniain - Sensors, 2023 - mdpi.com
The growth in online child exploitation material is a significant challenge for European Law
Enforcement Agencies (LEAs). One of the most important sources of such online information …
Enforcement Agencies (LEAs). One of the most important sources of such online information …