Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Disentangling voice and content with self-supervision for speaker recognition

T Liu, KA Lee, Q Wang, H Li - Advances in Neural …, 2023 - proceedings.neurips.cc
For speaker recognition, it is difficult to extract an accurate speaker representation from
speech because of its mixture of speaker traits and content. This paper proposes a …

Breast cancer diagnosis using multiple activation deep neural network

K Vijayakumar, VJ Kadam… - Concurrent …, 2021 - journals.sagepub.com
Deep Neural Network (DNN) stands for multilayered Neural Network (NN) that is capable of
progressively learn the more abstract and composite representations of the raw features of …

A survey on presentation attack detection for automatic speaker verification systems: State-of-the-art, taxonomy, issues and future direction

CB Tan, MHA Hijazi, N Khamis… - Multimedia Tools and …, 2021 - Springer
The emergence of biometric technology provides enhanced security compared to the
traditional identification and authentication techniques that were less efficient and secure …

End-to-end speaker verification via curriculum bipartite ranking weighted binary cross-entropy

Z Bai, J Wang, XL Zhang, J Chen - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
End-to-end speaker verification achieves the verification through estimating directly the
similarity score between a pair of utterances, which is formulated as a binary (ie, target …

Statnet: Spectral and temporal features based multi-task network for audio spoofing detection

R Ranjan, M Vatsa, R Singh - 2022 IEEE International Joint …, 2022 - ieeexplore.ieee.org
With the rise in mobile phone users and VoIP, voice has emerged as an easy and
accessible biometric modality for identification or verification tasks. Given the increasing …

Analysis-based optimization of temporal dynamic convolutional neural network for text-independent speaker verification

SH Kim, H Nam, YH Park - IEEE Access, 2023 - ieeexplore.ieee.org
Temporal dynamic convolution neural networks (TDY-CNNs) extract speaker embeddings
considering the time-varying characteristics of speech and improve text-independent …

Phoneme-aware and channel-wise attentive learning for text dependentspeaker verification

Y Liu, Z Li, L Li, Q Hong - arXiv preprint arXiv:2106.13514, 2021 - arxiv.org
This paper proposes a multi-task learning network with phoneme-aware and channel-wise
attentive learning strategies for text-dependent Speaker Verification (SV). In the proposed …

Leveraging speaker attribute information using multi task learning for speaker verification and diarization

C Luu, P Bell, S Renals - arXiv preprint arXiv:2010.14269, 2020 - arxiv.org
Deep speaker embeddings have become the leading method for encoding speaker identity
in speaker recognition tasks. The embedding space should ideally capture the variations …

Speaker-utterance dual attention for speaker and utterance verification

T Liu, RK Das, M Madhavi, S Shen, H Li - arXiv preprint arXiv:2008.08901, 2020 - arxiv.org
In this paper, we study a novel technique that exploits the interaction between speaker traits
and linguistic content to improve both speaker verification and utterance verification …