A review of deep learning techniques for speech processing
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …
learning. The use of multiple processing layers has enabled the creation of models capable …
Cn-celeb: multi-genre speaker recognition
Research on speaker recognition is extending to address the vulnerability in the wild
conditions, among which genre mismatch is perhaps the most challenging, for instance …
conditions, among which genre mismatch is perhaps the most challenging, for instance …
Domain generalization with relaxed instance frequency-wise normalization for multi-device acoustic scene classification
While using two-dimensional convolutional neural networks (2D-CNNs) in image
processing, it is possible to manipulate domain information using channel statistics, and …
processing, it is possible to manipulate domain information using channel statistics, and …
Playing a part: Speaker verification at the movies
The goal of this work is to investigate the performance of popular speaker recognition
models on speech segments from movies, where often actors intentionally disguise their …
models on speech segments from movies, where often actors intentionally disguise their …
A model-agnostic meta-baseline method for few-shot fault diagnosis of wind turbines
X Liu, W Teng, Y Liu - Sensors, 2022 - mdpi.com
The technology of fault diagnosis is helpful to improve the reliability of wind turbines, and
further reduce the operation and maintenance cost at wind farms. However, in reality, wind …
further reduce the operation and maintenance cost at wind farms. However, in reality, wind …
Meta-generalization for domain-invariant speaker verification
Automatic speaker verification (ASV) exhibits unsatisfactory performance under domain
mismatch conditions owing to intrinsic and extrinsic factors, such as variations in speaking …
mismatch conditions owing to intrinsic and extrinsic factors, such as variations in speaking …
Domain agnostic few-shot learning for speaker verification
Deep learning models for verification systems often fail to generalize to new users and new
environments, even though they learn highly discriminative features. To address this …
environments, even though they learn highly discriminative features. To address this …
Model-Agnostic Meta-Learning for Fast Text-Dependent Speaker Embedding Adaptation
By constraining the lexical content of input speech, text-dependent speaker verification (TD-
SV) offers more reliable performance than text-independent speaker verification (TI-SV) …
SV) offers more reliable performance than text-independent speaker verification (TI-SV) …
Learning domain-invariant transformation for speaker verification
Automatic speaker verification (ASV) faces domain shift caused by the mismatch of intrinsic
and extrinsic factors such as recording device and speaking style in real-world applications …
and extrinsic factors such as recording device and speaking style in real-world applications …
Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms
The ability of countermeasure models to generalize from seen speech synthesis methods to
unseen ones has been investigated in the ASVspoof challenge. However, a new mismatch …
unseen ones has been investigated in the ASVspoof challenge. However, a new mismatch …