Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

Discriminative neural embedding learning for short-duration text-independent speaker verification

S Wang, Z Huang, Y Qian, K Yu - IEEE/ACM Transactions on …, 2019 - ieeexplore.ieee.org
Short duration text-independent speaker verification remains a hot research topic in recent
years, and deep neural network based embeddings have shown impressive results in such …

Re-examining the robustness of voice features in predicting depression: Compared with baseline of confounders

W Pan, J Flint, L Shenhav, T Liu, M Liu, B Hu, T Zhu - PloS one, 2019 - journals.plos.org
A large proportion of Depression Disorder patients do not receive an effective diagnosis,
which makes it necessary to find a more objective assessment to facilitate a more rapid and …

Development of security systems using DNN and i & x-vector classifiers

O Mamyrbayev, A Kydyrbekova, K Alimhan… - 2021 - dspace.enu.kz
The widespread use of biometric systems entails increased interest from cybercriminals
aimed at developing attacks to crack them. Thus, the development of biometric identification …

Deep neural network based i-vector mapping for speaker verification using short utterances

J Guo, N Xu, K Qian, Y Shi, K Xu, Y Wu, A Alwan - Speech Communication, 2018 - Elsevier
Text-independent speaker recognition using short utterances is a highly challenging task
due to the large variation and content mismatch between short utterances. I-vector and …

Exploring the use of an unsupervised autoregressive model as a shared encoder for text-dependent speaker verification

V Ravi, R Fan, A Afshan, H Lu, A Alwan - arXiv preprint arXiv:2008.03615, 2020 - arxiv.org
In this paper, we propose a novel way of addressing text-dependent automatic speaker
verification (TD-ASV) by using a shared-encoder with task-specific decoders. An …

Deep discriminant analysis for i-vector based robust speaker recognition

S Wang, Z Huang, Y Qian, K Yu - 2018 11th International …, 2018 - ieeexplore.ieee.org
Linear Discriminant Analysis (LDA) has been used as a standard post-processing procedure
in many state-of-the-art speaker recognition tasks. Through maximizing the inter-speaker …

Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification

A Afshan, J Guo, SJ Park, V Ravi, A McCree… - arXiv preprint arXiv …, 2020 - arxiv.org
The effects of speaking-style variability on automatic speaker verification were investigated
using the UCLA Speaker Variability database which comprises multiple speaking styles per …

[PDF][PDF] Artificial Bandwidth Extension with Memory Inclusion Using Semi-supervised Stacked Auto-encoders.

PB Bachhav, M Todisco, NWD Evans - INTERSPEECH, 2018 - researchgate.net
Artificial bandwidth extension (ABE) algorithms have been developed to improve quality
when wideband devices receive speech signals from narrowband devices or infrastructure …

Data augmentation with moment-matching networks for i-vector based speaker verification

S Shiota, S Takamichi, T Matsui - 2018 Asia-Pacific Signal and …, 2018 - ieeexplore.ieee.org
This paper proposes an i-vector generation scheme with conditional generative moment-
matching networks (MMNs) for speaker verification. In this scheme, multiple i-vectors for …