Short utterance compensation in speaker verification via cosine-based teacher-student learning...

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

被引用次数：400 相关文章所有 9 个版本

[PDF] arxiv.org

Meta-learning for short utterance speaker recognition with imbalance length pairs

SM Kye, Y Jung, HB Lee, SJ Hwang, H Kim - arXiv preprint arXiv …, 2020 - arxiv.org

In practical settings, a speaker recognition system needs to identify a speaker given a short
utterance, while the enrollment utterance may be relatively long. However, existing speaker …

被引用次数：62 相关文章所有 9 个版本

[PDF] arxiv.org

RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies

J Kim, H Shim, J Heo, HJ Yu - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Despite achieving satisfactory performance in speaker verification using deep neural
networks, variable-duration utterances remain a challenge that threatens the robustness of …

被引用次数：27 相关文章所有 5 个版本

[PDF] arxiv.org

Online end-to-end neural diarization with speaker-tracing buffer

Y Xue, S Horiguchi, Y Fujita… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org

This paper proposes a novel online speaker diarization algorithm based on a fully
supervised self-attention mechanism (SA-EEND). Online diarization inherently presents a …

被引用次数：54 相关文章所有 8 个版本

[PDF] arxiv.org

Replay attack detection with complementary high-resolution information using end-to-end dnn for the asvspoof 2019 challenge

J Jung, H Shim, HS Heo, HJ Yu - arXiv preprint arXiv:1904.10134, 2019 - arxiv.org

In this study, we concentrate on replacing the process of extracting hand-crafted acoustic
feature with end-to-end DNN using complementary high-resolution spectrograms. As a …

被引用次数：57 相关文章所有 9 个版本

[PDF] arxiv.org

ECAPA2: A hybrid neural network architecture and training strategy for robust speaker embeddings

J Thienpondt, K Demuynck - 2023 IEEE Automatic Speech …, 2023 - ieeexplore.ieee.org

In this paper, we present ECAPA2, a novel hybrid neural network architecture and training
strategy to produce robust speaker embeddings. Most speaker verification models are …

被引用次数：9 相关文章所有 3 个版本

[PDF] ieee.org

Knowledge distillation in acoustic scene classification

JW Jung, HS Heo, HJ Shim, HJ Yu - IEEE Access, 2020 - ieeexplore.ieee.org

Common acoustic properties that different classes share degrades the performance of
acoustic scene classification systems. This results in a phenomenon where a few confusing …

被引用次数：42 相关文章所有 6 个版本

[PDF] arxiv.org

Graph attentive feature aggregation for text-independent speaker verification

H Shim, J Heo, JH Park, GH Lee… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

The objective of this paper is to combine multiple frame-level features into a single utterance-
level representation considering pair-wise relationships. For this purpose, we propose a …

被引用次数：17 相关文章所有 4 个版本

[PDF] arxiv.org

Towards robust speaker verification with target speaker enhancement

C Zhang, M Yu, C Weng, D Yu - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

This paper proposes the target speaker enhancement based speaker verification network
(TASE-SVNet), an all neural model that couples target speaker enhancement and speaker …

被引用次数：20 相关文章所有 3 个版本

[HTML] nih.gov

You never know what you are going to get: Large-scale assessment of therapists' supportive counseling skill use.

X Zhang, M Tanana, L Weitzman, S Narayanan… - …, 2023 - psycnet.apa.org

Supportive counseling skills like empathy and active listening are critical ingredients of all
psychotherapies, but most research relies on client or therapist reports of the treatment …

被引用次数：8 相关文章所有 8 个版本