Singer identification for metaverse with timbral and middle-level perceptual features

X Zhang, J Wang, N Cheng… - 2022 International Joint …, 2022 - ieeexplore.ieee.org
Metaverse is an interactive world that combines reality and virtuality, where participants can
be virtual avatars. Anyone can hold a concert in a virtual concert hall, and users can quickly …

Self-supervised contrastive learning for singing voices

H Yakura, K Watanabe, M Goto - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
This study introduces self-supervised contrastive learning to acquire feature representations
of singing voices. To acquire robust representations in an unsupervised manner, regular self …

An educational guide through the FMP notebooks for teaching and learning fundamentals of music processing

M Müller - Signals, 2021 - mdpi.com
This paper provides a guide through the FMP notebooks, a comprehensive collection of
educational material for teaching and learning fundamentals of music processing (FMP) with …

Deep learning approaches in topics of singing information processing

C Gupta, H Li, M Goto - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
Singing, the vocal productionof musical tones, is one of the most important elements of
music. Addressing the needs of real-world applications, the study of technologies related to …

Metasid: Singer identification with domain adaptation for metaverse

X Zhang, J Wang, N Cheng… - 2022 International Joint …, 2022 - ieeexplore.ieee.org
Metaverse has stretched the real world into unlimited space. There will be more live concerts
in Metaverse. The task of singer identification is to identify the song belongs to which singer …

vocadito: A dataset of solo vocals with , note, and lyric annotations

RM Bittner, K Pasalo, JJ Bosch… - arXiv preprint arXiv …, 2021 - arxiv.org
To compliment the existing set of datasets, we present a small dataset entitled vocadito,
consisting of 40 short excerpts of monophonic singing, sung in 7 different languages by …

Deformable cnn and imbalance-aware feature learning for singing technique classification

Y Yamamoto, J Nam, H Terasawa - arXiv preprint arXiv:2206.12230, 2022 - arxiv.org
Singing techniques are used for expressive vocal performances by employing temporal
fluctuations of the timbre, the pitch, and other components of the voice. Their classification is …

Addressing the confounds of accompaniments in singer identification

TH Hsieh, KH Cheng, ZC Fan… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Identifying singers is an important task with many applications. However, the task remains
challenging due to many issues. One major issue is related to the confounding factors from …

Learning a joint embedding space of monophonic and mixed music signals for singing voice

K Lee, J Nam - arXiv preprint arXiv:1906.11139, 2019 - arxiv.org
Previous approaches in singer identification have used one of monophonic vocal tracks or
mixed tracks containing multiple instruments, leaving a semantic gap between these two …

Semantic tagging of singing voices in popular music recordings

KL Kim, J Lee, S Kum, CL Park… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Singing voice is a key sound source in popular music. As recent music streaming and
entertainment services call for more intelligent solutions to retrieve songs or evaluate …