A survey on text-dependent and text-independent speaker verification
Speaker verification (SV) aims to detect an individual's identity from his/her voice. SV has
been successfully applied in various areas such as access control, remote service …
been successfully applied in various areas such as access control, remote service …
Disentangling voice and content with self-supervision for speaker recognition
For speaker recognition, it is difficult to extract an accurate speaker representation from
speech because of its mixture of speaker traits and content. This paper proposes a …
speech because of its mixture of speaker traits and content. This paper proposes a …
MFA: TDNN with multi-scale frequency-channel attention for text-independent speaker verification with short utterances
The time delay neural network (TDNN) represents one of the state-of-the-art of neural
solutions to text-independent speaker verification. However, they require a large number of …
solutions to text-independent speaker verification. However, they require a large number of …
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification
The residual neural networks (ResNet) demonstrate the impressive performance in
automatic speaker verification (ASV). They treat the time and frequency dimensions equally …
automatic speaker verification (ASV). They treat the time and frequency dimensions equally …
Neural acoustic-phonetic approach for speaker verification with phonetic attention mask
Traditional acoustic-phonetic approach makes use of both spectral and phonetic information
when comparing the voice of speakers. While phonetic units are not equally informative, the …
when comparing the voice of speakers. While phonetic units are not equally informative, the …
Phoneme-aware and channel-wise attentive learning for text dependentspeaker verification
Y Liu, Z Li, L Li, Q Hong - arXiv preprint arXiv:2106.13514, 2021 - arxiv.org
This paper proposes a multi-task learning network with phoneme-aware and channel-wise
attentive learning strategies for text-dependent Speaker Verification (SV). In the proposed …
attentive learning strategies for text-dependent Speaker Verification (SV). In the proposed …
[HTML][HTML] Class token and knowledge distillation for multi-head self-attention speaker verification systems
This paper explores three novel approaches to improve the performance of speaker
verification (SV) systems based on deep neural networks (DNN) using Multi-head Self …
verification (SV) systems based on deep neural networks (DNN) using Multi-head Self …
Speaker-utterance dual attention for speaker and utterance verification
In this paper, we study a novel technique that exploits the interaction between speaker traits
and linguistic content to improve both speaker verification and utterance verification …
and linguistic content to improve both speaker verification and utterance verification …
An adaptive X-vector model for text-independent speaker verification
In this paper, adaptive mechanisms are applied in deep neural network (DNN) training for x-
vector-based text-independent speaker verification. First, adaptive convolutional neural …
vector-based text-independent speaker verification. First, adaptive convolutional neural …
aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems
Metric learning approaches have widely expanded to the training of Speaker Verification
(SV) systems based on Deep Neural Networks (DNNs), by using a loss function more …
(SV) systems based on Deep Neural Networks (DNNs), by using a loss function more …