[HTML][HTML] A comparison review of transfer learning and self-supervised learning: Definitions, applications, advantages and limitations
Deep learning has emerged as a powerful tool in various domains, revolutionising machine
learning research. However, one persistent challenge is the scarcity of labelled training …
learning research. However, one persistent challenge is the scarcity of labelled training …
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation
The performance of spoofing countermeasure systems depends fundamentally upon the use
of sufficiently representative training data. With this usually being limited, current solutions …
of sufficiently representative training data. With this usually being limited, current solutions …
Disentangling voice and content with self-supervision for speaker recognition
For speaker recognition, it is difficult to extract an accurate speaker representation from
speech because of its mixture of speaker traits and content. This paper proposes a …
speech because of its mixture of speaker traits and content. This paper proposes a …
Large-scale self-supervised speech representation learning for automatic speaker verification
The speech representations learned from large-scale unlabeled data have shown better
generalizability than those from supervised learning and thus attract a lot of interest to be …
generalizability than those from supervised learning and thus attract a lot of interest to be …
Pushing the limits of raw waveform speaker recognition
In recent years, speaker recognition systems based on raw waveform inputs have received
increasing attention. However, the performance of such systems are typically inferior to the …
increasing attention. However, the performance of such systems are typically inferior to the …
Overview of speaker modeling and its applications: From the lens of deep speaker representation learning
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …
By thoroughly and accurately modeling this information, it can be utilized in various …
Voxsrc 2021: The third voxceleb speaker recognition challenge
The third instalment of the VoxCeleb Speaker Recognition Challenge was held in
conjunction with Interspeech 2021. The aim of this challenge was to assess how well current …
conjunction with Interspeech 2021. The aim of this challenge was to assess how well current …
Advancing speaker embedding learning: Wespeaker toolkit for research and production
Speaker modeling plays a crucial role in various tasks, and fixed-dimensional vector
representations, known as speaker embeddings, are the predominant modeling approach …
representations, known as speaker embeddings, are the predominant modeling approach …
Self-supervised speaker recognition with loss-gated learning
In self-supervised learning for speaker recognition, pseudo labels are useful as the
supervision signals. It is a known fact that a speaker recognition model doesn't always …
supervision signals. It is a known fact that a speaker recognition model doesn't always …
Self-supervised learning with cluster-aware-dino for high-performance robust speaker verification
The automatic speaker verification task has achieved great success using deep learning
approaches with a large-scale, manually annotated dataset. However, collecting a …
approaches with a large-scale, manually annotated dataset. However, collecting a …