Cn-celeb: multi-genre speaker recognition

L Li, R Liu, J Kang, Y Fan, H Cui, Y Cai, R Vipperla… - Speech …, 2022 - Elsevier
Research on speaker recognition is extending to address the vulnerability in the wild
conditions, among which genre mismatch is perhaps the most challenging, for instance …

Towards understanding and mitigating audio adversarial examples for speaker recognition

G Chen, Z Zhao, F Song, S Chen, L Fan… - … on Dependable and …, 2022 - ieeexplore.ieee.org
Speaker recognition systems (SRSs) have recently been shown to be vulnerable to
adversarial attacks, raising significant security concerns. In this work, we systematically …

Effectiveness of voice quality features in detecting depression

A Afshan, J Guo, SJ Park, V Ravi, J Flint, A Alwan - Interspeech 2018, 2018 - par.nsf.gov
Automatic assessment of depression from speech signals is affected by variabilities in
acoustic content and speakers. In this study, we focused on addressing these variabilities …

A time-frequency channel attention and vectorization network for automatic depression level prediction

M Niu, B Liu, J Tao, Q Li - Neurocomputing, 2021 - Elsevier
Physiological studies have illustrated that speech can be used as a biomarker to analyze the
severity of depression and different frequency bands of the speech spectrum contribute …

[PDF][PDF] SEC4SR: A security analysis platform for speaker recognition

G Chen, Z Zhao, F Song, S Chen, L Fan… - arXiv preprint arXiv …, 2021 - academia.edu
Adversarial attacks have been expanded to speaker recognition (SR). However, existing
attacks are often assessed using different SR models, recognition tasks and datasets, and …

Variational autoencoder for prosody‐based speaker recognition

SB Alex, L Mary - ETRI Journal, 2023 - Wiley Online Library
This paper describes a novel end‐to‐end deep generative model‐based speaker
recognition system using prosodic features. The usefulness of variational autoencoders …

Prosodic-enhanced siamese convolutional neural networks for cross-device text-independent speaker verification

S Soleymani, A Dabouei… - 2018 IEEE 9th …, 2018 - ieeexplore.ieee.org
In this paper a novel cross-device text-independent speaker verification architecture is
proposed. Majority of the state-of-the-art deep architectures that are used for speaker …

Robust features for text-independent speaker recognition with short utterances

R Chakroun, M Frikha - Neural Computing and Applications, 2020 - Springer
Speaker recognition systems achieve good performance under controlled conditions.
However, in real-world conditions, the performance degrades drastically. The principal …

[HTML][HTML] Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles

SJ Park, G Yeung, N Vesselinova, J Kreiman… - The Journal of the …, 2018 - pubs.aip.org
Little is known about human and machine speaker discrimination ability when utterances
are very short and the speaking style is variable. This study compares text-independent …

WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals

M Niu, J Tao, Y Li, Y Qin, Y Li - IEEE Transactions on Affective …, 2023 - ieeexplore.ieee.org
Physiological reports have confirmed that there are differences in speech signals between
depressed and healthy individuals. Therefore, as an application in the field of affective …