Adversarial training for multi-domain speaker recognition

Q Wang, K Okabe, KA Lee… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

State-of-the-art speaker recognition systems comprise a speaker embedding front-end
followed by a probabilistic linear discriminant analysis (PLDA) back-end. The effectiveness …

被引用次数：7 相关文章所有 5 个版本

[PDF] ieee.org

Language agnostic speaker embedding for cross-lingual personalized speech generation

Y Zhou, X Tian, H Li - IEEE/ACM Transactions on Audio …, 2021 - ieeexplore.ieee.org

Cross-lingual personalized speech generation seeks to synthesize a target speaker's voice
from only a few training samples that are in a different language. One popular technique is to …

被引用次数：16 相关文章所有 3 个版本

[PDF] arxiv.org

Disentangled speaker representation learning via mutual information minimization

SH Mun, MH Han, M Kim, D Lee… - 2022 Asia-Pacific Signal …, 2022 - ieeexplore.ieee.org

Domain mismatch problem caused by speaker-unrelated feature has been a major topic in
speaker recognition. In this paper, we propose an explicit disentanglement framework to …

被引用次数：8 相关文章所有 7 个版本

Multi-source domain adaptation and fusion for speaker verification

D Zhu, N Chen - IEEE/ACM Transactions on Audio, Speech …, 2022 - ieeexplore.ieee.org

Since most of the conventional domain adaptation models for Speaker Verification (SV) task
apply adaptation from single source domain to the target domain, they may not be robust …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

An Investigation of Distribution Alignment in Multi-Genre Speaker Recognition

Z Zhou, J Chen, N Wang, L Li… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org

Multi-genre speaker recognition is becoming increasingly popular due to its ability to better
represent the complexities of real-world applications. However, a major challenge is the …

被引用次数：2 相关文章所有 3 个版本

CentriForce: Multiple-domain adaptation for domain-invariant speaker representation learning

Y Wei, J Du, H Liu, Z Zhang - IEEE Signal Processing Letters, 2022 - ieeexplore.ieee.org

In the real world, speaker recognition systems usually suffer from serious performance
degradation due to the domain mismatch between training and test conditions. To alleviate …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Multi-Domain Adaptation by Self-Supervised Learning for Speaker Verification

W Lin, L Li, D Wang - arXiv preprint arXiv:2309.14149, 2023 - arxiv.org

In real-world applications, speaker recognition models often face various domain-mismatch
challenges, leading to a significant drop in performance. Although numerous domain …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Adversarial data augmentation for robust speaker verification

Z Zhou, J Chen, N Wang, L Li, D Wang - Proceedings of the 2023 9th …, 2023 - dl.acm.org

Data augmentation (DA) has gained widespread popularity in deep speaker models due to
its ease of implementation and significant effectiveness. It enriches training data by …

被引用次数：2 相关文章所有 4 个版本

[PDF] mdpi.com

A Simple Unsupervised Knowledge-Free Domain Adaptation for Speaker Recognition

W Lin, L Li, D Wang - Applied Sciences, 2024 - mdpi.com

Despite the great success of speaker recognition models based on deep neural networks,
deploying a pre-trained model in real-world scenarios often leads to significant performance …

SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition

T Wang, L Li, D Wang - arXiv preprint arXiv:2406.07832, 2024 - arxiv.org

Deploying a well-optimized pre-trained speaker recognition model in a new domain often
leads to a significant decline in performance. While fine-tuning is a commonly employed …