Using Voice Quality Features to Improve Short-Utterance, Text-Independent Speaker Verification...

L Li, R Liu, J Kang, Y Fan, H Cui, Y Cai, R Vipperla… - Speech …, 2022 - Elsevier

Research on speaker recognition is extending to address the vulnerability in the wild
conditions, among which genre mismatch is perhaps the most challenging, for instance …

被引用次数：128 相关文章所有 7 个版本

[PDF] arxiv.org

Towards understanding and mitigating audio adversarial examples for speaker recognition

G Chen, Z Zhao, F Song, S Chen, L Fan… - … on Dependable and …, 2022 - ieeexplore.ieee.org

Speaker recognition systems (SRSs) have recently been shown to be vulnerable to
adversarial attacks, raising significant security concerns. In this work, we systematically …

被引用次数：43 相关文章所有 9 个版本

[PDF] nsf.gov

Effectiveness of voice quality features in detecting depression

A Afshan, J Guo, SJ Park, V Ravi, J Flint, A Alwan - Interspeech 2018, 2018 - par.nsf.gov

Automatic assessment of depression from speech signals is affected by variabilities in
acoustic content and speakers. In this study, we focused on addressing these variabilities …

被引用次数：79 相关文章所有 7 个版本

A time-frequency channel attention and vectorization network for automatic depression level prediction

M Niu, B Liu, J Tao, Q Li - Neurocomputing, 2021 - Elsevier

Physiological studies have illustrated that speech can be used as a biomarker to analyze the
severity of depression and different frequency bands of the speech spectrum contribute …

被引用次数：27 相关文章

[PDF] academia.edu

[PDF][PDF] SEC4SR: A security analysis platform for speaker recognition

G Chen, Z Zhao, F Song, S Chen, L Fan… - arXiv preprint arXiv …, 2021 - academia.edu

Adversarial attacks have been expanded to speaker recognition (SR). However, existing
attacks are often assessed using different SR models, recognition tasks and datasets, and …

被引用次数：15 相关文章所有 3 个版本

[PDF] wiley.com Full View

Variational autoencoder for prosody‐based speaker recognition

SB Alex, L Mary - ETRI Journal, 2023 - Wiley Online Library

This paper describes a novel end‐to‐end deep generative model‐based speaker
recognition system using prosodic features. The usefulness of variational autoencoders …

被引用次数：5 相关文章所有 4 个版本

[PDF] arxiv.org

Prosodic-enhanced siamese convolutional neural networks for cross-device text-independent speaker verification

S Soleymani, A Dabouei… - 2018 IEEE 9th …, 2018 - ieeexplore.ieee.org

In this paper a novel cross-device text-independent speaker verification architecture is
proposed. Majority of the state-of-the-art deep architectures that are used for speaker …

被引用次数：22 相关文章所有 6 个版本

[PDF] academia.edu

Robust features for text-independent speaker recognition with short utterances

R Chakroun, M Frikha - Neural Computing and Applications, 2020 - Springer

Speaker recognition systems achieve good performance under controlled conditions.
However, in real-world conditions, the performance degrades drastically. The principal …

被引用次数：18 相关文章所有 5 个版本

[HTML] aip.org

[HTML][HTML] Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles

SJ Park, G Yeung, N Vesselinova, J Kreiman… - The Journal of the …, 2018 - pubs.aip.org

Little is known about human and machine speaker discrimination ability when utterances
are very short and the speaking style is variable. This study compares text-independent …

被引用次数：22 相关文章所有 12 个版本

[PDF] researchgate.net

WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals

M Niu, J Tao, Y Li, Y Qin, Y Li - IEEE Transactions on Affective …, 2023 - ieeexplore.ieee.org

Physiological reports have confirmed that there are differences in speech signals between
depressed and healthy individuals. Therefore, as an application in the field of affective …

被引用次数：3 相关文章所有 4 个版本