Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Meta-learning for short utterance speaker recognition with imbalance length pairs

SM Kye, Y Jung, HB Lee, SJ Hwang, H Kim - arXiv preprint arXiv …, 2020 - arxiv.org
In practical settings, a speaker recognition system needs to identify a speaker given a short
utterance, while the enrollment utterance may be relatively long. However, existing speaker …

RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies

J Kim, H Shim, J Heo, HJ Yu - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Despite achieving satisfactory performance in speaker verification using deep neural
networks, variable-duration utterances remain a challenge that threatens the robustness of …

Online end-to-end neural diarization with speaker-tracing buffer

Y Xue, S Horiguchi, Y Fujita… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org
This paper proposes a novel online speaker diarization algorithm based on a fully
supervised self-attention mechanism (SA-EEND). Online diarization inherently presents a …

Replay attack detection with complementary high-resolution information using end-to-end dnn for the asvspoof 2019 challenge

J Jung, H Shim, HS Heo, HJ Yu - arXiv preprint arXiv:1904.10134, 2019 - arxiv.org
In this study, we concentrate on replacing the process of extracting hand-crafted acoustic
feature with end-to-end DNN using complementary high-resolution spectrograms. As a …

ECAPA2: A hybrid neural network architecture and training strategy for robust speaker embeddings

J Thienpondt, K Demuynck - 2023 IEEE Automatic Speech …, 2023 - ieeexplore.ieee.org
In this paper, we present ECAPA2, a novel hybrid neural network architecture and training
strategy to produce robust speaker embeddings. Most speaker verification models are …

Knowledge distillation in acoustic scene classification

JW Jung, HS Heo, HJ Shim, HJ Yu - IEEE Access, 2020 - ieeexplore.ieee.org
Common acoustic properties that different classes share degrades the performance of
acoustic scene classification systems. This results in a phenomenon where a few confusing …

Graph attentive feature aggregation for text-independent speaker verification

H Shim, J Heo, JH Park, GH Lee… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
The objective of this paper is to combine multiple frame-level features into a single utterance-
level representation considering pair-wise relationships. For this purpose, we propose a …

Towards robust speaker verification with target speaker enhancement

C Zhang, M Yu, C Weng, D Yu - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
This paper proposes the target speaker enhancement based speaker verification network
(TASE-SVNet), an all neural model that couples target speaker enhancement and speaker …

You never know what you are going to get: Large-scale assessment of therapists' supportive counseling skill use.

X Zhang, M Tanana, L Weitzman, S Narayanan… - …, 2023 - psycnet.apa.org
Supportive counseling skills like empathy and active listening are critical ingredients of all
psychotherapies, but most research relies on client or therapist reports of the treatment …