Spleeter: a fast and efficient music source separation tool with pre-trained models

P Dhariwal, H Jun, C Payne, JW Kim… - arXiv preprint arXiv …, 2020 - assets.pubpub.org

We introduce Jukebox, a model that generates music with singing in the raw audio domain.
We tackle the long context of raw audio using a multiscale VQ-VAE to compress it to discrete …

被引用次数：757 相关文章所有 8 个版本

[PDF] arxiv.org

Hybrid transformers for music source separation

S Rouard, F Massa, A Défossez - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org

A natural question arising in Music Source Separation (MSS) is whether long range
contextual information is useful, or whether local acoustic features are sufficient. In other …

被引用次数：111 相关文章所有 3 个版本

[PDF] arxiv.org

Music source separation with band-split RNN

Y Luo, J Yu - IEEE/ACM Transactions on Audio, Speech, and …, 2023 - ieeexplore.ieee.org

The performance of music source separation (MSS) models has been greatly improved in
recent years thanks to the development of novel neural network architectures and training …

被引用次数：68 相关文章所有 4 个版本

[PDF] arxiv.org

Music source separation in the waveform domain

A Défossez, N Usunier, L Bottou, F Bach - arXiv preprint arXiv:1911.13254, 2019 - arxiv.org

Source separation for music is the task of isolating contributions, or stems, from different
instruments recorded individually and arranged together to form a song. Such components …

被引用次数：290 相关文章所有 13 个版本

[PDF] rcaap.pt

Audio features for music emotion recognition: a survey

R Panda, R Malheiro, RP Paiva - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

The design of meaningful audio features is a key need to advance the state-of-the-art in
music emotion recognition (MER). This article presents a survey on the existing emotionally …

被引用次数：127 相关文章所有 13 个版本

[PDF] arxiv.org

Decoupling magnitude and phase estimation with deep resunet for music source separation

Q Kong, Y Cao, H Liu, K Choi, Y Wang - arXiv preprint arXiv:2109.05418, 2021 - arxiv.org

Deep neural network based methods have been successfully applied to music source
separation. They typically learn a mapping from a mixture spectrogram to a set of source …

被引用次数：93 相关文章所有 4 个版本

Artificial intelligence foundation and pre-trained models: Fundamentals, applications, opportunities, and social impacts

A Kolides, A Nawaz, A Rathor, D Beeman… - … Modelling Practice and …, 2023 - Elsevier

With the emergence of foundation models (FMs) that are trained on large amounts of data at
scale and adaptable to a wide range of downstream applications, AI is experiencing a …

被引用次数：31 相关文章

[PDF] frontiersin.org

Music demixing challenge 2021

Y Mitsufuji, G Fabbro, S Uhlich, FR Stöter… - Frontiers in Signal …, 2022 - frontiersin.org

Music source separation has been intensively studied in the last decade and tremendous
progress with the advent of deep learning could be observed. Evaluation campaigns such …

被引用次数：79 相关文章所有 5 个版本

[PDF] arxiv.org

D3net: Densely connected multidilated densenet for music source separation

N Takahashi, Y Mitsufuji - arXiv preprint arXiv:2010.01733, 2020 - arxiv.org

Music source separation involves a large input field to model a long-term dependence of an
audio signal. Previous convolutional neural network (CNN)-based approaches address the …

被引用次数：87 相关文章所有 2 个版本

[PDF] arxiv.org

Deepsinger: Singing voice synthesis with data mined from the web

Y Ren, X Tan, T Qin, J Luan, Z Zhao… - Proceedings of the 26th …, 2020 - dl.acm.org

In this paper, we develop DeepSinger, a multi-lingual multi-singer singing voice synthesis
(SVS) system, which is built from scratch using singing training data mined from music …

被引用次数：71 相关文章所有 4 个版本