[PDF][PDF] Jukebox: A generative model for music
P Dhariwal, H Jun, C Payne, JW Kim… - arXiv preprint arXiv …, 2020 - assets.pubpub.org
We introduce Jukebox, a model that generates music with singing in the raw audio domain.
We tackle the long context of raw audio using a multiscale VQ-VAE to compress it to discrete …
We tackle the long context of raw audio using a multiscale VQ-VAE to compress it to discrete …
Hybrid transformers for music source separation
A natural question arising in Music Source Separation (MSS) is whether long range
contextual information is useful, or whether local acoustic features are sufficient. In other …
contextual information is useful, or whether local acoustic features are sufficient. In other …
Music source separation with band-split RNN
The performance of music source separation (MSS) models has been greatly improved in
recent years thanks to the development of novel neural network architectures and training …
recent years thanks to the development of novel neural network architectures and training …
Music source separation in the waveform domain
Source separation for music is the task of isolating contributions, or stems, from different
instruments recorded individually and arranged together to form a song. Such components …
instruments recorded individually and arranged together to form a song. Such components …
Audio features for music emotion recognition: a survey
The design of meaningful audio features is a key need to advance the state-of-the-art in
music emotion recognition (MER). This article presents a survey on the existing emotionally …
music emotion recognition (MER). This article presents a survey on the existing emotionally …
Decoupling magnitude and phase estimation with deep resunet for music source separation
Deep neural network based methods have been successfully applied to music source
separation. They typically learn a mapping from a mixture spectrogram to a set of source …
separation. They typically learn a mapping from a mixture spectrogram to a set of source …
Artificial intelligence foundation and pre-trained models: Fundamentals, applications, opportunities, and social impacts
A Kolides, A Nawaz, A Rathor, D Beeman… - … Modelling Practice and …, 2023 - Elsevier
With the emergence of foundation models (FMs) that are trained on large amounts of data at
scale and adaptable to a wide range of downstream applications, AI is experiencing a …
scale and adaptable to a wide range of downstream applications, AI is experiencing a …
Music demixing challenge 2021
Music source separation has been intensively studied in the last decade and tremendous
progress with the advent of deep learning could be observed. Evaluation campaigns such …
progress with the advent of deep learning could be observed. Evaluation campaigns such …
D3net: Densely connected multidilated densenet for music source separation
N Takahashi, Y Mitsufuji - arXiv preprint arXiv:2010.01733, 2020 - arxiv.org
Music source separation involves a large input field to model a long-term dependence of an
audio signal. Previous convolutional neural network (CNN)-based approaches address the …
audio signal. Previous convolutional neural network (CNN)-based approaches address the …
Deepsinger: Singing voice synthesis with data mined from the web
In this paper, we develop DeepSinger, a multi-lingual multi-singer singing voice synthesis
(SVS) system, which is built from scratch using singing training data mined from music …
(SVS) system, which is built from scratch using singing training data mined from music …