A survey on text-dependent and text-independent speaker verification

Y Tu, W Lin, MW Mak - IEEE Access, 2022 - ieeexplore.ieee.org
Speaker verification (SV) aims to detect an individual's identity from his/her voice. SV has
been successfully applied in various areas such as access control, remote service …

Disentangling voice and content with self-supervision for speaker recognition

T Liu, KA Lee, Q Wang, H Li - Advances in Neural …, 2023 - proceedings.neurips.cc
For speaker recognition, it is difficult to extract an accurate speaker representation from
speech because of its mixture of speaker traits and content. This paper proposes a …

MFA: TDNN with multi-scale frequency-channel attention for text-independent speaker verification with short utterances

T Liu, RK Das, KA Lee, H Li - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
The time delay neural network (TDNN) represents one of the state-of-the-art of neural
solutions to text-independent speaker verification. However, they require a large number of …

Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification

T Liu, KA Lee, Q Wang, H Li - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
The residual neural networks (ResNet) demonstrate the impressive performance in
automatic speaker verification (ASV). They treat the time and frequency dimensions equally …

Neural acoustic-phonetic approach for speaker verification with phonetic attention mask

T Liu, RK Das, KA Lee, H Li - IEEE Signal Processing Letters, 2022 - ieeexplore.ieee.org
Traditional acoustic-phonetic approach makes use of both spectral and phonetic information
when comparing the voice of speakers. While phonetic units are not equally informative, the …

Phoneme-aware and channel-wise attentive learning for text dependentspeaker verification

Y Liu, Z Li, L Li, Q Hong - arXiv preprint arXiv:2106.13514, 2021 - arxiv.org
This paper proposes a multi-task learning network with phoneme-aware and channel-wise
attentive learning strategies for text-dependent Speaker Verification (SV). In the proposed …

[HTML][HTML] Class token and knowledge distillation for multi-head self-attention speaker verification systems

V Mingote, A Miguel, A Ortega, E Lleida - Digital Signal Processing, 2023 - Elsevier
This paper explores three novel approaches to improve the performance of speaker
verification (SV) systems based on deep neural networks (DNN) using Multi-head Self …

Speaker-utterance dual attention for speaker and utterance verification

T Liu, RK Das, M Madhavi, S Shen, H Li - arXiv preprint arXiv:2008.08901, 2020 - arxiv.org
In this paper, we study a novel technique that exploits the interaction between speaker traits
and linguistic content to improve both speaker verification and utterance verification …

An adaptive X-vector model for text-independent speaker verification

B Gu, W Guo, L Dai, J Du - arXiv preprint arXiv:2002.06049, 2020 - arxiv.org
In this paper, adaptive mechanisms are applied in deep neural network (DNN) training for x-
vector-based text-independent speaker verification. First, adaptive convolutional neural …

aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems

V Mingote, A Miguel, D Ribas… - … /ACM Transactions on …, 2022 - ieeexplore.ieee.org
Metric learning approaches have widely expanded to the training of Speaker Verification
(SV) systems based on Deep Neural Networks (DNNs), by using a loss function more …