MIRNet: Learning multiple identities representations in overlapped speech

SM Kye, Y Jung, HB Lee, SJ Hwang, H Kim - arXiv preprint arXiv …, 2020 - arxiv.org

In practical settings, a speaker recognition system needs to identify a speaker given a short
utterance, while the enrollment utterance may be relatively long. However, existing speaker …

被引用次数：62 相关文章所有 9 个版本

[PDF] thecvf.com

SYENet: A simple yet effective network for multiple low-level vision tasks with real-time performance on mobile device

W Gou, Z Yi, Y Xiang, S Li, Z Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

With the rapid development of AI hardware accelerators, applying deep learning-based
algorithms to solve various low-level vision tasks on mobile devices has gradually become …

被引用次数：2 相关文章所有 5 个版本

[HTML] mdpi.com

[HTML][HTML] Deep learning-based technique for remote sensing image enhancement using multiscale feature fusion

M Zhao, R Yang, M Hu, B Liu - Sensors, 2024 - mdpi.com

The present study proposes a novel deep-learning model for remote sensing image
enhancement. It maintains image details while enhancing brightness in the feature …

被引用次数：5 相关文章所有 9 个版本

[PDF] arxiv.org

Supervised attention for speaker recognition

SM Kye, JS Chung, H Kim - 2021 IEEE Spoken Language …, 2021 - ieeexplore.ieee.org

The recently proposed self-attentive pooling (SAP) has shown good performance in several
speaker recognition systems. In SAP systems, the context vector is trained end-to-end …

被引用次数：16 相关文章所有 5 个版本

[PDF] arxiv.org

Attention-based broad self-guided network for low-light image enhancement

Z Chen, Y Liang, M Du - 2022 26th International Conference on …, 2022 - ieeexplore.ieee.org

Low-light image enhancement is widely used in many fields, such as target detection, face
recognition, and image segmentation. In recent years, Deep Learning methods have …

被引用次数：10 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] Light-Weight Speaker Verification with Global Context Information.

M Kim, Z Piao, SY Um, R Lee, J Joh, S Lee… - …, 2022 - isca-archive.org

In this paper, we propose a light-weight speaker verification (SV) system that utilizes the
characteristics of utterancelevel global features. Many recent SV tasks employ convolutional …

被引用次数：5 相关文章所有 3 个版本

[PDF] arxiv.org

Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings

S Horiguchi, A Ando, T Moriya, T Ashihara… - arXiv preprint arXiv …, 2024 - arxiv.org

This paper proposes a method for extracting speaker embedding for each speaker from a
variable-length recording containing multiple speakers. Speaker embeddings are crucial not …

A Region Based Non-overlapping Reference Speech Estimation Method for Speaker Extraction

Y Zhang, Z Li, B Liu, H Fan, Y Yang, Q Yang - International Conference on …, 2024 - Springer

Speaker extraction is a technique that separates the target speech from multi-talker mixtures
using a priori information about the target speaker, such as pre-enrolled reference speech …

被引用次数：1 相关文章所有 2 个版本