[PDF][PDF] Jukebox: A generative model for music

P Dhariwal, H Jun, C Payne, JW Kim… - arXiv preprint arXiv …, 2020 - assets.pubpub.org
We introduce Jukebox, a model that generates music with singing in the raw audio domain.
We tackle the long context of raw audio using a multiscale VQ-VAE to compress it to discrete …

Hybrid transformers for music source separation

S Rouard, F Massa, A Défossez - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
A natural question arising in Music Source Separation (MSS) is whether long range
contextual information is useful, or whether local acoustic features are sufficient. In other …

Music source separation with band-split RNN

Y Luo, J Yu - IEEE/ACM Transactions on Audio, Speech, and …, 2023 - ieeexplore.ieee.org
The performance of music source separation (MSS) models has been greatly improved in
recent years thanks to the development of novel neural network architectures and training …

Music source separation in the waveform domain

A Défossez, N Usunier, L Bottou, F Bach - arXiv preprint arXiv:1911.13254, 2019 - arxiv.org
Source separation for music is the task of isolating contributions, or stems, from different
instruments recorded individually and arranged together to form a song. Such components …

Audio features for music emotion recognition: a survey

R Panda, R Malheiro, RP Paiva - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
The design of meaningful audio features is a key need to advance the state-of-the-art in
music emotion recognition (MER). This article presents a survey on the existing emotionally …

Decoupling magnitude and phase estimation with deep resunet for music source separation

Q Kong, Y Cao, H Liu, K Choi, Y Wang - arXiv preprint arXiv:2109.05418, 2021 - arxiv.org
Deep neural network based methods have been successfully applied to music source
separation. They typically learn a mapping from a mixture spectrogram to a set of source …

Artificial intelligence foundation and pre-trained models: Fundamentals, applications, opportunities, and social impacts

A Kolides, A Nawaz, A Rathor, D Beeman… - … Modelling Practice and …, 2023 - Elsevier
With the emergence of foundation models (FMs) that are trained on large amounts of data at
scale and adaptable to a wide range of downstream applications, AI is experiencing a …

Music demixing challenge 2021

Y Mitsufuji, G Fabbro, S Uhlich, FR Stöter… - Frontiers in Signal …, 2022 - frontiersin.org
Music source separation has been intensively studied in the last decade and tremendous
progress with the advent of deep learning could be observed. Evaluation campaigns such …

D3net: Densely connected multidilated densenet for music source separation

N Takahashi, Y Mitsufuji - arXiv preprint arXiv:2010.01733, 2020 - arxiv.org
Music source separation involves a large input field to model a long-term dependence of an
audio signal. Previous convolutional neural network (CNN)-based approaches address the …

Deepsinger: Singing voice synthesis with data mined from the web

Y Ren, X Tan, T Qin, J Luan, Z Zhao… - Proceedings of the 26th …, 2020 - dl.acm.org
In this paper, we develop DeepSinger, a multi-lingual multi-singer singing voice synthesis
(SVS) system, which is built from scratch using singing training data mined from music …