Speaker recognition based on deep learning: An overview
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …
learning has dramatically revolutionized speaker recognition. However, there is lack of …
Ecapa-tdnn: Emphasized channel attention, propagation and aggregation in tdnn based speaker verification
Current speaker verification techniques rely on a neural network to extract speaker
representations. The successful x-vector architecture is a Time Delay Neural Network …
representations. The successful x-vector architecture is a Time Delay Neural Network …
Deep learning methods in speaker recognition: a review
This paper summarizes the applied deep learning practices in the field of speaker
recognition, both verification and identification. Speaker recognition has been a widely used …
recognition, both verification and identification. Speaker recognition has been a widely used …
Mfa-conformer: Multi-scale feature aggregation conformer for automatic speaker verification
In this paper, we present Multi-scale Feature Aggregation Conformer (MFA-Conformer), an
easy-to-implement, simple but effective backbone for automatic speaker verification based …
easy-to-implement, simple but effective backbone for automatic speaker verification based …
Cn-celeb: multi-genre speaker recognition
Research on speaker recognition is extending to address the vulnerability in the wild
conditions, among which genre mismatch is perhaps the most challenging, for instance …
conditions, among which genre mismatch is perhaps the most challenging, for instance …
ECAPA-TDNN embeddings for speaker diarization
Learning robust speaker embeddings is a crucial step in speaker diarization. Deep neural
networks can accurately capture speaker discriminative characteristics and popular deep …
networks can accurately capture speaker discriminative characteristics and popular deep …
Integrating frequency translational invariance in tdnns and frequency positional information in 2d resnets to enhance speaker verification
This paper describes the IDLab submission for the text-independent task of the Short-
duration Speaker Verification Challenge 2021 (SdSVC-21). This speaker verification …
duration Speaker Verification Challenge 2021 (SdSVC-21). This speaker verification …
[PDF][PDF] Densely Connected Time Delay Neural Network for Speaker Verification.
Time delay neural network (TDNN) has been widely used in speaker verification tasks.
Recently, two TDNN-based models, including extended TDNN (E-TDNN) and factorized …
Recently, two TDNN-based models, including extended TDNN (E-TDNN) and factorized …
Autospeech: Neural architecture search for speaker recognition
Speaker recognition systems based on Convolutional Neural Networks (CNNs) are often
built with off-the-shelf backbones such as VGG-Net or ResNet. However, these backbones …
built with off-the-shelf backbones such as VGG-Net or ResNet. However, these backbones …
RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies
Despite achieving satisfactory performance in speaker verification using deep neural
networks, variable-duration utterances remain a challenge that threatens the robustness of …
networks, variable-duration utterances remain a challenge that threatens the robustness of …